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SEQUENCE LISTING 



Sequence No . : 1 
Sequence length: 154 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 
aypotnecicai : wo 
Original source: 

Organism species: Homo sapiens 

Cell kind: Fibrosarcoma Is 

Cell line: HT-1080 

Clone name: HP00658 
Sequence description 



Met Lys Val Ser Ala Ala Ala Leu Ala'Val He Leu He Ala Thr Ala 



10 



15 



Leu Cys Ala Pro Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro 



20 



25 



30 



Cys Cys Phe Ala Tyr He Ala Arg Pro Leu Pro Arg Ala His He Lys 



35 



AO 



A5 



Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val His 



50 



55 



60 



Arg Ser Arg Met Pro Lys Arg Glu Gly Gin Gin Val Trp Gin Asp Phe 



65 



70 



75 



80 



Leu Tyr Asp Ser Arg Leu Asn Lys Gly Lys Leu Cys His Pro Lys Glu 



85 



90 



95 



Pro Pro Ser Val Cys Gin Pro Arg Glu Glu Met Gly Ser Gly Val His 



1 on 



1 



no 
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115 120 125 

Thr Gin He Cys Leu Phe Leu Leu Ala Leu Val Leu Ala Trp Glu Ala 

130 135 140 

Ser Pro His Tyr Pro, Thr Pro Pro Ala Pro 
145 \50 



Sequence No. : 2 
Sequence length: 315 
Sequence type: Amino acid 
Topology: Linear 

Sequence kind: Protein 

t 

Hypothetical: No 
Original source: 

Organism species: Homo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP00714 
Sequence description 

Met Asp Leu Arg Gin Phe Leu Met Cys Leu Ser Leu Cys Thr Ala Phe 

1 5 10 15 

Ala Leu Ser Lys Pro Thr Glu Lys Lys Asp Arg Val His His Glu Pro 

20 25 30 

Gin Leu Ser Asp Lys Val His Asn Asp Ala Gin Ser Phe Asp Tyr Asp 

35 40 45 

His Asp Ala Phe Leu Gly Ala Glu Glu Ala Lys Thr Phe Asp Gin Leu 

50 55 60 

Thr Pro Glu Glu Ser Lys Glu Arg Leu Gly Lys He Val Ser Lys He 
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66 



85 90 95 

He Lys Phe Ala Gin Lys Arg Trp He Tyr Glu Asp Val Glu Arg Gin 

10,0 105 11° 

Trp Lys Gly His Asp Leu Asn Glu Asp Gly Leu Val Ser Trp Glu Glu 

115 \ 120 I 25 

Tyr Lys Asn Ala Thr Tyr Gly Tyr Val Leu Asp Asp Pro Asp Pro Asp 

130 135 1*0 

Asp Gly Phe Asn Tyr Lys Gin Met Met Val Arg Asp Glu Arg Arg Phe 
145 150 155 160 

Lys Met Ala Asp Lys Asp Gly, Asp Leu lie Ala Thr Lys Glu Glu Phe 

165 



/ 



170 



17^ 



Thr Ala Phe Leu His Pro Glu Glu Tyr Asp Tyr Met Lys Asp He Val 

180 185 190 

Val Gin Glu Thr Met Glu Asp He Asp Lys Asn Ala Asp Gly Phe lie 

195 200 205 

Asp Leu Glu Glu Tyr He Gly Asp Met Tyr Ser His Asp Gly Asn Thr 

210 215 220 

Asp Glu Pro Glu Trp Val Lys Thr Glu Arg Gltt Gin Phe Val Glu Phe 
225 230 235 2A0 

Arg Asp Lys Asn Arg Asp Gly Lys Met Asp Lys Glu Glu Thr Lys Asp 

245 250 



255 



Trp He Leu Pro Ser Asp Tyr Asp His Ala Glu Ala Glu Ala Arg His 

265 270 



260 



Leu Val Tyr Glu Ser Asp Gin Asn Lys Asp Gly Lys Leu Thr Lys Glu 



275 



280 



285 



Glu He Val Asp Lys Tyr Asp Leu Phe Val Gly Ser Gin Ala Thr Asp 
290 295 300 



WO 98/11217 




PCT/JP97/03239 



Sequence No. : 3 
Sequence length: 158 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 
Hypothetical: No 
Original source: 

Organism species: Eomo Sapiens 

Cell kind: Stomach cancer 

Clone name: HP00876 
Sequence description 

Met Ala Ser Arg Ser Met Arg Leu Leu Leu Leu Leu Ser Cys Leu Ala 

15 10 15 

Lys Thr Gly Val Leu Gly Asp He He Met Arg Pro Ser Cys Ala Pro 

20 25 30 

Gly Trp Phe Tyr His Lys Ser Asn Cys Tyr Gly Tyr Phe Arg Lys Leu 

35 40 45 

Arg Asn Trp Ser Asp Ala Glu Leu Glu Cys Gin Ser Tyr Gly Asn Gly 

50 55 60 

Ala His Leu Ala Ser He Leu Ser Leu Lys Glu Ala Ser Thr He Ala 
65 70 75 80 

Glu Tyr He Ser Gly Tyr Gin Arg Ser Gin Pro He Trp He Gly Leu 

85 90 95 

His Asp Pro Gin Lys Arg Gin Gin Trp Gin Trp He Asp Gly Ala Met 

100 105 HO 

Tyr Leu Tyr Arg Ser Trp Ser Gly Lys Ser Met Gly Gly Asn Lys His 
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68 

130 135 1A0 

Glu Cys Asn Lys Arg Gin His Phe Leu Cys Lys Tyr Arg Pro 
145 150 155 



Sequence No. : 4 
Sequence length: 376 
Sequence type: Amino acid 
Topology: lineal 
Sequence kind: Protein 
Hypothetical: No 1 

i 

Original source: 

Organism species: Homo sapiens 

Cell kind: Liver 

Clone name: HP01134 
Sequence description 

Met Val Trp Lys Val Ala Val Phe Leu Ser Val Ala Leu Gly He Gly 

x 5 10 15 

Ala Val Pro He Asp Asp Pro Glu Asp Gly Gly Lys His Trp Val Val 

20 25 30 

lie Val Ala Gly Ser Asn Gly Trp Tyr Asn Tyr Arg His Gin Ala Asp 

35 AO 45 

Ala Cys His Ala Tyr Gin He He His Arg Asn Gly He Pro Asp Glu 

50 55 60 

Gin He Val Val Met Met Tyr Asp Asp He Ala Tyr Ser Glu Asp Asn 
65 70 75 80 

Pro Thr Pro Gly He Val He Asn Arg Pro Asn Gly Thr Asp Val Tyr 

85 90 95 

rvv Vn 1 Pro T.vs Asp Tvr Thr Gly Glu Asp Val Thr Pro Gin Asn 
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Phe Leu Ala Val Leu Arg Gly Asp Ala Glu Ala Val Lys Gly lie Gly 

115 120 I 25 

Ser Gly Lys Val Leu Lys Ser Gly Pro Gin Asp His Val Phe He Tyr 

130 135 1*0 

Phe Thr Asp His Gly Ser Thr Gly He Leu Val Phe Pro Asn Glu Asp 
145 150 155 160 

Leu His Val Lys Asp Le,u Asn Glu Thr He His Tyr Met Tyr Lys His 

165 170 175 

Lys Met Tyr Arg Lys Met Val Phe Tyr He Glu Ala Cys Glu Ser Gly 

180 1 185 190 

I 

Ser Met Met Asn His Leu Pro\Asp Asn He Asn Val ryr Ala Thr Thr 

195 2Q0 205 

Ala Ala Asn Pro Arg Glu Ser Ser Tyr Ala Cys Tyr Tyr Asp Glu Lys 

210 215 220 

Arg Ser Thr Tyr Leu Gly Asp Trp Tyr Ser Val Asn Trp Met Glu Asp 
225 230 235 240 

Ser Asp Val Glu Asp Leu Thr Lys Glu Thr Leu His Lys Gin Tyr His 

2A5 250 255 

Leu Val Lys Ser His Thr Asn Thr Ser His Val Met Gin Tyr Gly Asn 

260 265 270 

Lys Thr He Ser Thr Met Lys Val Met Gin Phe Gin Gly Met Lys Arg 

275 280 285 

Lys Ala Ser Ser Pro Val Pro Leu Pro Pro Val Thr His Leu Asp Leu 

290 295 300 

Thr Pro Ser Pro Asp Val Pro Leu Thr He Met Lys Arg Lys Leu Met 
305 310 315 320 

Asn Thr Asn Asp Leu Glu Glu Ser Arg Gin Leu Thr Glu Glu He Gin 
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'340 345 350 

Asn Leu Cys Olu Lys Pro Tyr Pro Leu His Arg lie Lys Leu Ser Met 

355 360 365 

Asp His Val Cys teu Gly His Tyr 
370 375 

Sequence No. : 5 
Sequence length: 173 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 
Hypothetical: No * ^ 

Original source: 

Organism species: Homo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP10029 
Sequence description 

Met Ala Ala Pro Ser Gly Gly Trp Asn Gly Val Arg Ala Ser Leu Trp 

15 10 15 

Ala Ala Leu Leu Leu Gly Ala Val Ala Leu Arg Pro Ala Glu Ala Val 

20 25 30 

Ser Glu Pro Thr Thr Val Ala Phe Asp Val Arg Pro Gly Gly Val Val 

35 40 45 

His Ser Phe Ser His Asn Val Gly Pro Gly Asp Lys Tyr Thr Cys Met 

50 55 60 

Phe Thr Tyr Ala Ser Gin Gly Gly Thr Asn Glu Gin Trp Gin Met Ser 
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85 90 95 

Pro Gin Gly Lys Ser Tyr Leu Tyr Phe Thr Gin Phe Lys Ala Glu Val 

100 105 110 

Arg Gly Ala Glu %le Glu Tyr Ala Met Ala Tyr Ser Lys Ala Ala Phe 

115 120 125 

Glu Arg Glu Ser Asp Val Pro Leu Lys Thr Glu Glu Phe Glu Val Thr 

130 135 140 

Lys Thr Ala Val Ala B^s Arg Pro Gly Ala Phe Lys Ala Glu Leu Ser 

145 150 155 160 

Lys Leu Val lie Val Ala Lys Ala Ser Arg Thr Glu Leu 

165 170 

I 

Sequence No. : 6 
Sequence length: 73 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 
Hypothetical: No 
Original source: 

Organism species: Homo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP10189 
Sequence description 

Met Gly Val Lys Leu Glu lie Phe Arg Met He He Tyr Leu Thr Phe 

15 10 15 

Pro Val Ala Met Phe Trp Val Ser Asn Gin Ala Glu Trp Phe Glu Asp 
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72 

35 « * 5 

Glu lie Glu Glu Phe Lys Glu Arg Leu Arg Lys Arg Arg Glu Glu Lys 

50 55 60 

Leu Leu Arg Asp Ala Gin Gin Asn Ser 
65 70 



Sequence No. : 7 
Sequence lengtn: xx/* 
Sequence type: Amino acid 
Topology: Linear 

kind: Protein . 

Hypothetical: No 
Original source: 

Organism species: Homo sapiens 

Cell kind: Histiocyte lymphoma 

Cell line: U937 

Clone name: HP10269 



Sequence description 
Met Arg Pro Phe Phe Leu Leu Cys Phe Ala Leu Pro Gly Leu Leu His 

1 5 10 15 

Ala Gin Gin Ala Cys Ser Arg Gly Ala Cy 8 Tyr Pro Pro Val Gly Asp 

20 25 30 

Leu Leu Val Gly Arg Thr Arg Phe Leu Arg Ala Ser Ser Thr Cys Gly 

35 40 * 5 

Leu Thr Lys Pro Glu Thr Tyr Cys Thr Gin Tyr Gly Glu Trp Gin Met 

50 55 60 

Lys Cys Cys Lys Cys Asp Ser Arg Gin Pro His Asn Tyr Tyr Ser His 

7" 75 80 
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73 



85 9° 95 

Ser Gin Asn Asp Vai Asn Pro Val Ser Leu Gin Leu Asp Leu Asp Arg 

100 105 HO 

Arg Phe Gin Leu Gin Glu Val Met Met Glu Phe Gin Gly Pro Met Pro 

115 120 125 

Ala Gly Met Leu lie Glu Arg Ser Ser Asp Phe Gly Lys Thr Trp Arg 

J_ v 

Val Tyr Gin Tyr Leu Ala Ala Asp Cys Thr Ser Thr Phe Pro Arg Val 
1*5 

Arg Gin Gly Arg Pro Gin Ser Xtp Gin Asp Val Arg Cys Gin Ser Leu 

165 J 170 1/5 

Pro Gin Arg Pro Asn Ala Arg Leu Asn Gly Gly Lys Val Gin Leu Asn 

180 "0 
Leu Met Asp Leu Val Ser Gly He Pro Ala Thr Gin Ser Gin Lys lie 

195 200 205 

Gin Glu Val Gly Glu He Thr Asn Leu Arg Val Asn Phe Thr Arg Leu 

210 215 220 

Ala Pro Val Pro Gin Arg Gly Tyr His Pro Pro Ser Ala Tyr Tyr Ala 
225 230 235 

Val Ser Gin Leu Arg Leu Gin Gly Ser Cys Phe Cys His Gly His Ala 

2A5 250 255 

Asp Arg Cys Ala Pro Lys Pro Gly Ala Ser Ala Gly Pro Ser Thr Ala 

260 265 270 

Val Gin Val His Asp Val Cys Val Cys Gin His Asn Thr Ala Gly Pro 

275 280 285 

Asn Cys Glu Arg Cys Ala Pro Phe Tyr Asn Asn Arg Pro Trp Arg Pro 
290 295 



300 
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His Ser Glu Thr Cys His Phe Asp Pro Ala Val Phe Ala Ala Ser Gin 

325 330 335 

Gly Ala Tyr Gly Gly \^al Cys Asp Asn Cys Arg Asp His Thr Glu Gly 

340 345 350 

Lys Asn Cys Glu Arg Cys Gin Leu His Tyr Phe Arg Asn Arg Arg Pro 

355 360 365 

Gly Ala Ser He Gin Glu Thr Cys He Ser Cys Glu Cys Asp Pro Asp 

370 37$ 380 

Gly Ala Val Pro Gly Ala Pro Cys Asp Pro Val Thr Gly Gin Cys Val 

385 390 * 395 400 

JL 

Cys Lys Glu His Val Gin Gly 6lu Arg Cys Asp Leu uys Lys P^o Gly 



405 410 415 

Phe Thr Gly Leu Thr Tyr Ala Asn Pro Gin Gly Cys His Arg Cys Asp 

420 425 430 

Cys Asn He Leu Gly Ser Arg Arg Asp Met Pro Cys Asp Glu Glu Ser 

435 440 445 

Gly Arg Cys Leu Cys Leu Pro Asn Val Val Gly Pro Lys Cys Asp Gin 

450 455 460 

Cys Ala Pro Tyr His Trp Lys Leu Ala Ser Gly Gin Gly Cys Glu Pro 
465 470 475 480 

Cys Ala Cys Asp Pro His Asn Ser Leu Ser Pro Gin Cys Asn Gin Phe 

485 490 495 

Thr Gly Gin Cys Pro Cys Arg Glu Gly Phe Gly Gly Leu Met Cys Ser 

500 505 510 

Ala Ala Ala He Arg Gin Cys Pro Asp Arg Thr Tyr Gly Asp Val Ala 

515 520 525 

Thr Gly Cys Arg Ala Cys Asp Cys Asp Phe Arg Gly Thr Glu Gly Pro 
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75 



545 



550 



555 



560 



Gly Pro Arg Cys Asp Gin Cys Gin Arg Gly Tyr Cys Asn Arg Tyr Pro 

565 570 575 

Val Cys Val Ala Cys His Pro Cys Phe Gin Thr Tyr Asp Ala Asp Leu 

580 585 590 

Arg Glu Gin Ala Leu Arg Phe Gly Arg Leu Arg Asn Ala Thr Ala Ser 

595 600 605 

Leu Trp Ser Gly Pro Gly Leu Glu Asp Arg Gly Leu Ala Ser Arg He 

610 615 620 

Leu Asp Ala Lys Ser Lys He Glu, Gin He Arg Ala Val Leu Ser Ser 



625 



630 



635 



Pro Ala Val Thr Glu Gin Glu Val Ala Gin Val Ala Ser Ala lie Leu 



645 



650 



655 



Ser Leu Arg Arg Thr Leu Gin Gly Leu Gin Leu Asp Leu Pro Leu Glu 

660 665 670 

Glu Glu Thr Leu Ser Leu Pro Arg Asp Leu Glu Ser Leu Asp Arg Ser 



675 



680 



685 



Phe Asn Gly Leu Leu Thr Met Tyr Gin Arg Lys Arg Glu Gin Phe Glu 



690 



695 



700 



Lys He Ser Ser Ala Asp Pro Ser Gly Ala Phe Arg Met Leu Ser Thr 
705 710 715 720 

Ala Tyr Glu Gin Ser Ala Gin Ala Ala Gin Gin Val Ser Asp Ser Ser 

725 730 735 

Arg Leu Leu Asp Gin Leu Arg Asp Ser Arg Arg Glu Ala Glu Arg Leu 

740 745 750 

Val Arg Gin Ala Gly Gly Gly Gly Gly Thr Gly Ser Pro Lys Leu Val 
755 760 765 
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Asn Lys Leu Cys Gly Asn Ser Arg Gin Met Ala Cys Thr Pro lie Ser 
7B5 790 795 800 

Cys Pro Gly Glu Leu Cys Pro Gin Asp Asn Gly Thr Ala Cys Gly Ser 

805 810 815 

Arg Cys Arg Gly Val Leu Pro Arg Ala Gly Gly Ala Phe Leu Met Ala 

820 825 830 

Gly Gin Val Ala Glu Gin Leu Arg Gly Phe Asn Ala Gin Leu Gin Arg 



835 



840 



OH J 



Thr Arg Gin Met He Arg Ala Ala Glu Glu Ser Ala Ser Gin He Gin 



850 



855 



860 



Ser Ser Ala Gin Arg Leu Glu Thr Cln Val S^r Ala Ser Arg Ser Gin 



865 



870 



875 



880 



Met Glu Glu Asp Val Arg Arg Thr Arg Leu Leu He Gin Gin Val Arg 



885 



890 



895 



Asp Phe Leu Thr Asp Pro Asp Thr Asp Ala Ala Thr He Gin Glu Val 

900 905 910 

Ser Glu Ala Val Leu Ala Leu Trp Leu Pro Thr Asp Ser Ala Thr Val 



915 



920 



925 



Leu Gin Lys Met Asn Glu He Gin Ala lie Ala Ala Arg Leu Pro Asn 



930 



935 



940 



Val Asp Leu Val Leu Ser Gin Thr Lys Gin Asp He Ala Arg Ala Arg 



945 



950 



955 



960 



Arg Leu Gin Ala Glu Ala Glu Glu Ala Arg Ser Arg Ala His Ala Val 



965 



970 



975 



Glu Gly Gin Val Glu Asp Val Val Gly Asn Leu Arg Gin Gly Thr Val 



980 



985 



990 



Ala Leu Gin Glu Ala Gin Asp Thr Met Gin Gly Thr Ser Arg Ser Leu 
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1010 1015 1020 

Ala Glu Lys Leu Val Thr Ser Met Thr Lys Gin Leu Gly Asp Phe Trp 
1025 1030 1035 1040 

Thr Arg Met Glu Glu Leu Arg His Gin Ala Arg Gin Gin Gly Ala Glu 

1045 1050 1055 

Ala Val Gin Ala Gin Gin Leu Ala Glu Gly Ala Ser Glu Gin Ala Leu 

1060 1065 1070 

Ser Ala Gin Glu Gly Phe Glu Arg lie Lys Gin Lys Tyr Ala Glu Leu 

1075 1080 1085 

Lys Asp Arg Leu Gly Gin Ser Sey Met Leu Gly Glu Gin Gly Ala Arg 

1090 1095 1100 

lie Gin Ser Val Lys Thr Glu Ala Glu Glu Leu Phe Gly Glu Thr Met 
1105 1110 1115 1120 

Glu Met Met Asp Arg Met Lys Asp Met Glu Leu Glu Leu Leu Arg Gly 

1125 1130 1135 

Ser Gin Ala lie Met Leu Arg Ser Ala Asp Leu Thr Gly Leu Glu Lys 

1140 1145 1150 

Arg Val Glu Gin lie Arg Asp His He Asn Gly Arg Val Leu Tyr Tyr 

1155 1160 1165 

Ala Thr Cys Lys 
1170 

Sequence No. : 8 
Sequence length: 122 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 



1^1 lltl 



K lili C. t 
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Organism species: Homo sapiens 
Cell kind: Stomach cancer 
Clone name: HP10298 
Sequence description 

Met Gly Leu Leu Leu Leu Val Pro Leu Leu Leu Leu Pro Gly Ser Tyr 

1 5 10 15 

Gly Leu Pro Phe Tyr Asn Gly Phe Tyr Tyr Ser Asn Ser Ala Asn Asp 

20 25 30 

Gin Asn Leu Gly Asn Gly His Gly Lys Asp Leu Leu Asn Gly Val Lys 
35 AO 45 

. . - _ _ . _ _ -■ — * mm* * rvT_ _ IPl> — Ah Tin Tm* T*V» «• 

Leu Vai Val uiu rnr vro i*iu biu i^ii i^eu me mi ^ & ±^ *^ 

50 55 60 

Val Gly Pro Gin Ser Leu Gly Ser Glu Ala Leu Ala Ser Pro Thr Arg 
65 70 75 80 

Arg Ala Ala Cys Thr Val Phe Thr Ala Thr Ala Ser Thr Arg Thr Trp 

85 90 95 

Gly Pro Pro Leu Pro His Ser Leu Thr Gly Cys Val Phe He Glu Trp 

100 105 HO 

Phe Val Phe Pro Cys Gly Leu Glu Pro Phe 
115 120 



Sequence No. : 9 
Sequence length: 175 
Sequence type: Amino acid 
Topology: Linear 
Sequence kind: Protein 
Hypothetical: No 
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Cell kind: Stomach cancer 
Clone name: HP10368 
Sequence description 

Met Glu Lys lie Pro Val Ser Ala Phe Leu Leu Leu Val Ala Leu Ser 

1 5 10 15 

Tyr Thr Leu Ala Arg Asp Thr Thr Val Lys Pro Gly Ala Lys Lys Asp 

70 25 30 

Thr Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser Arg Gly Trp 

35 40 45 

Gly Asp Gin Leu He Trp Thr Gin' Thr Tyr Glu Glu Ala Leu Tyr Lys 

50 55 60 

Ser Lys Thr Ser Asn Lys Pro Leu Met He He His His Leu Asp Glu 
65 70 75 80 

Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu Asn Lys Glu 

85 90 95 

He Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu Val Tyr Glu 

100 105 110 

Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val Pro Arg He 

115 120 125 

Met Phe Val Asp Pro Ser Leu Thr Val Arg Ala Asp He Thr Gly Arg 

130 135 140 

Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ala Asp Thr Ala Leu Leu 
145 150 155 160 

Leu Asp Asn Met Lys Lys Ala Leu Lys Leu Leu Lys Thr Glu Leu 

165 170 175 



Sequence No . : 10 
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Strandedness : Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Homo sapiens 

Cell kind: Fibrosarcoma 

Cell line: HT-1080 

Clone name: HP00658 
Sequence description 
ATGAAGGTCT CCGCGGCAGC 
GCATCTGCCT CCCCATATTC 
CCACTGCCCC GTGCCCACAT 
GCAGTCGTCC ACAGGTCAAG 
CTGTATGACT CCCGGCTGAA 
TGCCAACCCA GAGAAGAAAT 
TGGAGAGTCC TTGAACCTGA 
GCTTGGGAGG CTTCCCCTCA 

Sequence No. : 11 
Sequence length: 945 
Sequence type: Nucleic acid 

Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Bomo sspiens 



CCTCGCTGTC ATCCTCATTG CTACTGCCCT CTGCGCTCCT 60 

CTCGCACACC ACACCCTGCT GCTTTGCCTA CATTGCCCGC 120 

CAAGGAGTAT TTCTACACCA GTGGCAAGTG CTCCAACCCA 180 

GATGCCAAAG AGAGAGGGAC AGCAAGTCTG GCAGGATTTC 240 

CAAGGGCAAG CTTTGTCACC CGAAAGAACC GCCAAGTGTG 300 

GGGTTCGGGA GTACATCAAC TCTTTGGAGA TGAGCTAGGA 360 

ACTTACACAA ATTTGCCTGT TTCTGCTTGC TCTTGTCCTA 420 

CTATCCTACC CCACCCGCTC CT 462 
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Clone name: HP00714 
Sequence description 

ATGGACCTGC GACAGTTTCT TATGTGCCTG TCCCTGTGCA 
CCCACAGAAA AGAAGGACCG TGTACATCAT GAGCCTCAGC 
GATGCTCAGA GTTTTGATTA TGACCATGAT GCCTTCTTGG 
TTTGATCAGC TGACACCAGA AGAGAGCAAG GAAAGGCTTG 
I'.rrr*^ Af^ACGGGTT TGTCACTGTG GATGAGCTCA 
CAAAAGCGCT GGATTTACGA GGATGTAGAG CGACAGTGGA 
GACGGCCTCG TTTCCTGGGA GG AG TAT AAA AATGCCACCT 
CCAGATCCTG ATGATGGATT TAACTATAAA CAGATGATGG 
AAAATGGCAG ACAAGGATGG AGACCTCATT GCCACCAAGG 
CACCCTGAGG AGTATGACTA CATGAAAGAT ATAGTAGTAC 
GATAAGAATG CTGATGGTTT CATTGATCTA GAAGAGTATA 
GATGGGAATA CTGATGAGCC AGAATGGGTA AAGACAGAGC 
CGGGATAAGA ACCGTGATGG GAAGATGGAC AAGGAAGAGA 
TCAGACTATG ATCATGCAGA GGCAGAAGCC AGGCACCTGG 
AAGGATGGCA AGCTTACCAA GGAGGAGATC GTTGACAAGT 
CAGGCCACAG ATTTTGGGGA GGCCTTAGTA CGGCATGATG 

Sequence No,: 12 

Sequence length: 474 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Homo sapiens 

-„ on*' name . i:m)uo / i 



CAGCCTTTGC CTTGAGCAAA 60 

TCAGTGACAA GGTTCACAAT 120 

GTGCTGAAGA AGCAAAGACC 180 

GAAAGATTGT AAGTAAAATA 240 

AAGACTGGAT TAAATTTGCA 300 

AGGGGCATGA CCTCAATGAG 360 

ACGGCTACGT TTTAGATGAT 420 

TTAGAGATGA GCGGAGGTTT 480 

AGGAGTTCAC AGUTTTCCTG 540 

AGGAAACAAT GGAAGATATA 600 

TTGGTGACAT GTACAGCCAT 660 

GAGAGCAGTT TGTTGAGTTT 720 

CCAAAGACTG GATCCTTCCC 780 

TCTATGAATC AGACCAAAAC 840 

ATGACTTATT TGTTGGCAGC 900 

AGTTC 945 
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Sequence description 

ATGGCTTCCA GAAGGATGCG GCTGCTCCTA TTGCTGAGCT GCCTGGCCAA AACAGGAGTC 
CTGGGTGATA TCATCATGAG ACCCAGCTC, GCCCTGGA, «™» CAAGTCCAAT 
TGCTATGGTT ACTTCAGGAA MOUC TGCTCTGATG CCCAGCTCOA GTGTCAGTCT 

tacggaaacg gaggggagc, ggcatctatc cgagtitaa agoaagccag caccatagca 

rrrATATGGA TTGGCCTGCA CGACCCACAG 
GAGTACATAA GTGGCTATCA GAGAAGCCAG CCGATATGGA 

^r>cr. GATTGATGGG GCCATGTATC TGTACAGATC CTGGTCTGGC 
AAGTCCATGG GTGGGAACAA GCAGTGTGG, GAGATGAGCT CCAATAACAA CTTTTTAACT «» 

^aggagca acgaatgcaa caagggcoaa cacttcctg, gcaagtaccg acga 



60 
120 
180 
240 
300 
360 



Sequence No. : 13 
Sequence length: 1128 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Homo sapi 
Cell kind: Liver 
Clone name: HP01134 



Sequence description 








60 


ATGGTTTGGA AAGTAGCTGT 


ATTCCTCAGT 


GTGGCCCTGG 


GCATTGGTGC CGTTCCTATA 


GATGATCCTG AAGATGGAGG 


CAAGCACTGG 


GTGGTGATCG 


TGGCAGGTTC AAATGGCTGG 


120 


TATAATTATA GGCACCAGGC 


AGACGCGTGC 


CATGCCTACC 


AGATCATTCA CCGCAATGGG 


180 


ATTCCTGACG AACAGATCGT 


TGTGATGATG 


TACGATGACA 


TTGCTTACTC TGAAGACAAT 


240 


CCCACTCCAG GAATTGTGAT 


CAACAGGCCC 


AATGGCACAG 


ATGTCTATCA GGGAGTCCCG 


300 


AAGGACTACA CTGGAGAGGA 


TGTTACCCCA 


CAAAATTTCC 


TTGCTGTGTT GAGAGGCGAT 


360 


■"T^fTfATTT A ; : T T C AC T G A 






. :-Yt '.. 
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CTTCATGTAA 


AGGACCTdAA 


TGAGACCATC 


CATTACATGT 


ACAAACACAA 


AATGTACCGA 


540 


AAGATGGTGT 


TCTACATTGA 


AGCCTGTGAG 


TCTGGGTCCA 


TGATGAACCA 


CCTGCCGGAT 


600 


AACATCAATG 


TTTATGCAAC 


TACTGCTGCC 


AACCCCAGAG 


AGTCGTCCTA 


CGCCTGTTAC 


660 


TATGATGAGA 


AGAGGTCCAC 


GTACCTGGGG 


GACTGGTACA 


GCGTCAACTG 


GATGGAAGAC 


720 


TCGGACGTGG 


AAGATCTGAC 


TAAAGAGACC 


CTGCACAAGC 


AGTACCACCT 


GGTAAAATCG 


780 


CACACCAACA 


CCAGCCACGT 


CATGCAGTAT 


GGAAACAAAA 


CAATCTCCAC 


CATGAAAGTG 


840 


ATRCAGTTTC 


AGGGTATGAA 


ACGCAAAGCC 


AGTTCTCCCG 


TCCCCCTACC 


TCCAGTCACA 


900 


CACCTTGACC 


TCACCCCCAG 


CCCTGATGTG 


CCTCTCACCA 


TCATGAAAAG 


GAAACTGATG 


960 


AACACCAATG 


ATCTGGAGGA 


GTCCAGGCAG 


CTCACGGAGG 


AGATCCAGCG 


GCATCTGGAT 


1020 


TACGAGTATG 


CGTTGAGACA 


TTTGTACGTG 


CTGGTCAACC 


TTTGTGAGAA 


GCCGTATCCG 


1080 


CTTCACAGGA 


TAAAATTGTC 


CATGGACCAC 


GTGTGCCTTG 


GTCACTAC 




-i. u w 



Sequence No.: 14 

Sequence length: 519 

Sequence type: Nucleic acid 

Strandedness: Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Eomo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP10029 
Sequence description 

ATGGCGGCGC CCAGCGGAGG GTGGAACGGC GTCCGCGCGA GCTTGTGGGC CGCGCTGCTC 60 
CTAGGGGCCG TGGCGCTGAG GCCGGCGGAG GCGGTGTCCG AGCCCACGAC CGTGGCGTTT 120 
GACGTGCGGC CCGGCGGCGT CGTGCATTCC TTCTCCCATA ACGTGGGCCC GGGGGACAAA 180 
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84 

.o^a mrsTCCGGG GCGCTGAGAT TGAGTACGCC 360 
TCCTATCTGT ACTTCACACA GTTCAAGGCA GAGGTGCGGfc 

ATGGCCTA CT CTAAAGCCGC ATTTGAAAGG GAAAGTGATG TCCCTCTGAA AACTGAGGAA 420 
TTTGAAGTGA CCAAAACAGC AGTGGCTCAC AGGCCCGGGC CATTCAAAGC TGAGCTGTCC A80 

519 

AAGCTGGTGA TTGTGGCCAA GGCATCGCGC ACTGAGCTG 



Sequence No. : 15 
Sequence length; 219 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to rnKKA 
Original source: 

Organism species: Homo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP10189 
Sequence description 

ATGGGGGTGA AGCTGGAGAT ATTTCGGATG ATAATCTACC TCACTTTCCC TGTGGCTATG 
TTCTGGGTTT CCAATCAGGC CGAGTGGTTT GAGGACGATG TCATACAGCG CAAGAGGGAG 120 
CTGTGGCCAC CTGAGAAGCT TCAAGAGATA GAGGAATTCA AAGAGAGGTT ACGGAAGCGG 180 
CGGGAGGAGA AGCTCCTTCG CGACGCCCAG CAGAACTCC 



60 



219 



Sequence No. : 16 
Sequence length: 3516 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 



WO 98/11217 




Organism specieg: Homo sapiens 

Cell kind: Lymphoma 

Cell line: U937 

Clone name: HP10269 
Sequence description 
ATGAGACCAT TCTTCCTCTT GTGTTTTGCC 
vrrrnr.r.GTG GGGCCTGCTA TCCACCTGTT 
CTCCGAGCTT CATCTACCTG TGGACTGACC 
GAGTGGCAGA TGAAATGCTG CAAGTGTGAC 
CGAGTAGAGA ATGTGGCTTC ATCCTCCGGC 
GTGAACCCTG TCTCTCTGCA GCTGGACCTG 
ATGGAGTTCC AGGGGCCCAT GCCTGCCGGC 
AAGACCTGGC GAGTGTACCA GTACCTGGCT 
CGCCAGGGTC GGCCTCAGAG CTGGCAGGAT 
AATGCACGCC TAAATGGGGG GAAGGTCCAA 
CCAGCAACTC AAAGTCAAAA AATTCAAGAG 
TTCACCAGGC TGGCCCCTGT GCCCCAAAGG 
GTGTCCCAGC TCCGTCTGCA GGGGAGCTGC 
CCCAAGCCTG GGGCCTCTGC AGGCCCCTCC 
TGCCAGCACA ACACTGCCGG CCCAAATTGT 
CCCTGGAGAC CGGCGGAGGG CCAGGACGCC 
CACTCAGAGA CATGTCACTT TGACCCCGCT 
GGTGTGTGTG ACAATTGCCG GGACCACACC 
CACTATTTCC GGAACCGGCG CCCGGGAGCT 
TGTGATCCGG ATGGGGCAGT GCCAGGGGCT 
TGCAAGGAGC ATGTGCAGGG AGAGCGCTGT 
ACCTACGCCA ACCCGCAGGG CTGCCACCGC 
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CTGCCTGGCC TCCTGCATGC CCAACAAGCC 60 

GGGGACCTGC TTGTTGGGAG GACCCGGTTT 120 

AAGCCTGAGA CCTACTGCAC CCAGTATGGC 180 

TCCAGGCAGC CTCACAACTA CTACAGTCAC 240 

CCCATGCGCT GGTGGCAGTC CCAGAATGAT 300 

GACAGGAGAT TCCAGCTTCA AGAAGTCATG 360 

ATGCTGATTG AGCGCTCCTC AGACTTCGGT 420 

GCCGACTGCA CCTCCACCTT CCCTCGGGTC 480 

GTTCGGTGCC AGTCCCTGCC TCAGAGGCCT 540 

CTTAACCTTA TGGATTTAGT GTCTGGGATT 600 

GTGGGGGAGA TCACAAACTT GAGAGTGAAT 660 

GGCTACCACC CTCCCAGCGC CTACTATGCT 720 

TTCTGTCACG GCCATGCTGA TCGCTGCGCA 780 

ACCGCTGTGC AGGTCCACGA TGTCTGTGTC 840 

GAGCGCTGTG CACCCTTCTA CAACAACCGG 900 

CATGAATGCC AAAGGTGCGA CTGCAATGGG 960 

GTGTTTGCCG CCAGCCAGGG GGCATATGGA 1020 

GAAGGCAAGA ACTGTGAGCG GTGTCAGCTG 1080 

TCCATTCAGG AGACCTGCAT CTCCTGCGAG 1140 

CCCTGTGACC CAGTGACCGG GCAGTGTGTG 1200 

GACCTATGCA AGCCGGGCTT CACTGGACTC 1260 

TGTGACTGCA ACATCCTGGG GTCCCGGAGG 1320 
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TGTGCCTGCG ACCCGCACAA CTCCCTCAGC CCACAGTGCA ACCAGTTCAC AGGGCAGTGC 1500 

CCCTGTCGGG AAGGCTTTGG TGGCCTGATG TGCAGCGCTG CAGCCATCCG CCAGTGTCCA 1560 

GACCGGACCT ATGGAGACGT GGCCACAGGA TGCCGAGCCT GTGACTGTGA TTTCCGGGGA. 1620 

ACAGAGGGCC CGGGCTGCGA GAAGGCATCA GGCCGCTGCC TCTGCCGCCC TGGCTTGACC 1680 

GGGCCCCGCT GTGACCAGTG CCAGCGAGGC TACTGCAATC GCTACCCGGT GTGCGTGGCC 1740 

TGCCACCCTT GCTTCCAGAC CTATGATGCG GACCTCCGGG AGCAGGCCCT GCGCTTTGGT 1800 

inArvrrrzr.A ATGCCACCGC CAGCCTGTGG TCAGGGCCTG GGCTGGAGGA CCGTGGCCTG 1860 

GCCTCCCGGA TCCTAGATGC AAAGAGTAAG ATTGAGCAGA TCCGAGCAGT TCTCAGCAGC 1920 

CCCGCAGTCA CAGAGCAGGA GGTGGCTCAG GTGGCCAGTG CCATCCTCTC CCTCAGGCGA 1980 

ACTCTCCAGG GCCTGCAGCT GGATCTGCCC CTGGAGGAGG AGACGTTGTC CCTTCCGAGA 2040 

GACCTGGAGA GTCTTGACAG AAGCTTCAAT GGTCTCCTTA CTATGTATCA GAGGAAGAGG 210 0 

GAGCAGTTTG AAAAAATAAG CAGTGCTGAT CCTTCAGGAG CCTTCCGGAT GCTGAGCACA 2160 

GCCTACGAGC AGTCAGCCCA GGCTGCTCAG CAGGTCTCCG ACAGCTCGCG CCTTTTGGAC 2220 

CAGCTCAGGG ACAGCCGGAG AGAGGCAGAG AGGCTGGTGC GGCAGGCGGG AGGAGGAGGA 2280 

GGCACCGGCA GCCCCAAGCT TGTGGCCCTG AGGCTGGAGA TGTCTTCGTT GCCTGACCTG 2340 

ACACCCACCT TGAACAAGCT CTGTGGCAAC TCCAGGCAGA TGGCTTGCAC CCCAATATGA 2400 

TGCCCTGGTG AGCTATGTCC CCAAGACAAT GGCACAGCCT GTGGCTCCCG CTGCAGGGGT 2460 

GTCCTTCCCA GGGCCGGTGG GGCCTTCTTG ATGGCGGGGC AGGTGGCTGA GCAGCTGCGG 2520 

GGCTTCAATG CCCAGCTCCA GCGGACCAGG CAGATGATTA GGGCAGCCGA GGAATCTGCC 2580 

TCACAGATTC AATCCAGTGC CCAGCGCTTG GAGACCCAGG TGAGCGCCAG CCGCTCCCAG 2640 

ATGGAGGAAG ATGTCAGACG CACACGGCTC CTAATCCAGC AGGTCCGGGA CTTCCTAACA 2700 

GACCCCGACA CTGATGCAGC CACTATCCAG GAGGTCAGCG AGGCCGTGCT GGCCCTGTGG 2760 

CTGCCCACAG ACTCAGCTAC TGTTCTGCAG AAGATGAATG AGATCCAGGC CATTGCAGCC 2820 

AGGCTCCCCA ACGTGGACTT GGTGCTGTCC CAGACCAAGC AGGACATTGC GCGTGCCCGC 2880 

CGGTTGCAGG CTGAGGCTGA GGAAGCCAGG AGCCGAGCCC ATGCAGTGGA GGG CCAGGTG 2940 

GAAGATGTGG TTGGGAACCT GCGGCAGGGG ACAGTGGCAC TGCAGGAAGC TCAGGACACC 3000 

ATGCAAGGCA CCAGCCGCTC CCTTCGGCTT ATCCAGGACA GGGTTGCTGA GGTTCAGCAG 3060 
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CAGCAGCTTG CGGAAGGTCC CAGCGAGCAG GCATTGAGTG CCCAAGAGGG ATTTGAGAGA 3240 

ATAAAACAAA AGTATGCTGA GTTGAAGGAC CGGTTGGGTC AGAGTTCCAT GCTGGGTGAG 3300 

CAGGGTGCCC GGATCCAGAG TGTGAAGACA GAGGCAGAGG AGCTGTTTGG GGAGACCATG 3360 

GAGATGATGG ACAGGATGAA AGACATGGAG TTGGAGCTGC TGCGGGGCAG CCAGGCCATC 3420 

ATGCTGCGCT CAGCGGACCT GACAGGACTG GAGAAGCGTG TGGAGCAGAT CCGTGACCAC 3480 

ATCAATGGGC GCGTGCTCTA CTATGCCACC TGCAAG 3516 



Sequence No - : 17 

Sequence length: 366 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Eomo sapiens 
Cell kind: Stomach cancer 
Clone name: HP10298 



Sequence description 
ATGGGCCTGT TGCTCCTGGT 


CCCATTGCTC 


CTGCTGCCCG 


GCTCCTACGG 


ACTGCCCTTC 


60 


TACAACGGCT 


TCTACTACTC 


CAACAGCGCC 


AACGACCAGA ACCTAGGCAA 


CGGTCATGGC 


120 


AAAGACCTCC 


TTAATGGAGT 


GAAGCTGGTG 


GTGGAGACAC 


CCGAGGAGAC 


CCTGTTCACC 


180 


CGCATCCTAA 


CTGTGGGCCC 


CCAGAGCCTG 


GGGTCCGAAG 


CTTTGGCTTC 


CCCGACCCGC 


240 


AGAGCCGCTT 


GTACGGTGTT 


TACTGCTACC 


GCCAGCACTA 


GGACCTGGGG 


CCCTCCCCTG 


300 


CCGCATTCCC 
CCATTT 


TCACTGGCTG 


TGTATTTATT 


GAGTGGTTCG 


TTTTCCCTTG 


TGGGTTGGAG 


360 
366 



Sequence No . : 18 
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Strandedness: Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Homo sapiens 

Cell kind: Stomach cancer 

Clone name: HP10368 
Sequence description 

ATGGAGAAAA TTCCAGTGTC AGCATTCTTG CTCCTTGTGG CCCTCTCCTA CACTCTGGCC 60 
AGAGATACCA CAGTCAAACC TGGAGCCAAA AAGGACACAA AGGACTCTCG ACCCAAACTG 120 

^n*r r* a ArTr.ATCT GGACTCAGAC ATATGAAGAA 180 

CCCCAGACCC TCTULAWUa* iii**wxv«* 

GCTCTATATA AATCCAAGAC AAGCAACAAA CCCTTGATGA TTATTCATCA CTTGGATGAG 240 

TGCCCACACA GTCAAGCTTT AAAGAAAGTG TTTGCTGAAA ATAAAGAAAT CCAGAAATTG 300 

GCAGAGCAGT TTGTCCTCCT CAATCTGGTT TATGAAACAA CTGACAAACA CCTTTCTCCT 360 

GATGGCCAGT ATGTCCCCAG GATTATGTTT GTTGACCCAT CTCTGACAGT TAGAGCCGAT 420 

ATCACTGGAA GATATTCAAA CCGTCTCTAT GCTTACGAAC CTGCAGATAC AGCTCTGTTG 480 
CTTGACAACA TGAAGAAAGC TCTCAAGTTG CTGAAGACTG AATTG 



525 



Sequence No. : 19 

Sequence length: 1296 

Sequence type: Nucleic acid 

Strandedness: Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Homo sapiens 
Cell kind: Fibrosarcoma 
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Sequence characteristics: 

Code representing characteristics: CDS 

Existence site: 56.. 520 

Characterization method: E 
Sequence description 

CCTGCAGAGG ATCAAGACAG CACGTGGACC TCGCACAGCC TCTCCCACAG GTACC ATG 

Met 



MG GTC TCC GCG GCA GCC CTC GOT GTC ATC CTC ATT GCT ACT GCC CTC 
Lys Val Ser Ala Ala Ala Leu Ala Val He Leu lie Ala Thr Ala Leu 

5 ' 10 " 

TGC GCT CCT GCA TCT GCC TCC CCA TAT TCC TCG GAC ACC ACA CCC TGC 
Cys Ala Pro Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro Cys 

TGC TTT GCC TAG ATT GCC CGC CCA CTG CCC CGT GCC CAC ATC AAG GAG 
Cys Phe Ala Tyr lie Ala Arg Pro Leu Pro Arg Ala His He Lys Glu 
35 

TAT TTC TAC ACC AGT GGC AAG TGC TCC AAC CCA GCA GTC GTC CAC AGG 
Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val His Arg 

TCA AGG ATG CCA AAG AGA GAG GGA CAG CAA GTC TGG CAG GAT TTC CTG 
Ser Arg Met Pro Lys Arg Glu Gly Gin Gin Val Trp Gin Asp Phe Leu 

7S 80 
70 ,:> 

TAT GAC TCC CGG CTG AAC AAG GGC AAG CTT TGT CAC CCG AAA GAA CCG 
Tyr Asp Ser Arg Leu Asn Lys Gly Lys Leu Cys His Pro Lys Glu Pro 

85 90 95 

CCA AGT GTG TGC CAA CCC AGA GAA GAA ATG GGT TCG GGA GTA CAT CAA 



58 



106 



154 



202 



250 



298 



346 



39A 
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CTC TTT GGA GAT GAG CTA GG^ TGG AGA GTC CTT GAA CCT GAA CTT ACA 442 
Leu Phe Gly Asp Glu Leu Gly Trp Arg Val Leu Glu Pro Glu Leu Thr 

115 120 125 

CAA ATT TGC CTG TTT CTG CTT GCT CTT GTC CTA GCT TGG GAG GCT TCC 490 
Gin He Cys Leu Phe Leu Leu Ala Leu Val Leu Ala Trp Glu Ala Ser 
130 135 140 145 

rrr C.AC. TAT CCT ACC CCA CCC GCT CCT TGAAGGGCCC AGA 530 
Pro His Tyr Pro Thr Pro Pro Ala Pro 

150 

TTCTACCACA CAGCAGCAGT TACAAAAACC TTCCCCAGGC TGGACGTGGT GGCTCACGCC 590 

TGTAATCCCA GCACTTTGGG AGGCCAAGGT GS^TGGATCA CTTGAGGTCA GGAGTTCGAG 650 

ACCAGCCTGG CCAACATGAT GAAACCCCAT CTCTACTAAA AATACAAAAA ATTAGCCGGG 710 

CGTGGTAGCG GGCGCCTGTA GTCCCAGCTA CTCGGGAGGC TGAGGCAGGA GAATGGCGTG 770 

AACCCGGGAG GCGGAGCTTG CAGTGAGCCG AGATCGCGCC ACTGCACTCC AGCCTGGGCG 830 

ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA AAAAAAAAAA AAATACAAAA ATTAGCCGGG 890 

CGTGGTGGCC CACGCCTGTA ATCCCAGCTA CTCGGGAGGC TAAGGCAGGA AAATTGTTTG 950 

AACCCAGGAG GTGGAGGCTG CAGTGAGCTG AGATTGTGCC ACTTCACTCC AGCCTGGGTG 1010 

ACAAAGTGAG ACTCCGTCAC AACAACAACA ACAAAAAGCT TCCCCAACTA AAGCCTAGAA 1070 

GAGCTTCTGA GGCGCTGCTT TGTCAAAAGG AAGTCTCTAG GTTCTGAGCT CTGGCTTTGC 1130 

CTTGGCTTTG CCAGGGCTCT GTGACCAGGA AGGAAGTCAG CATGCCTCTA GAGGCAAGGA 1190 

GGGGAGG AAC GCTGCACTCT TAAGCTTCCG CCGTCTCAAC CCCTCACAGG AGCTTACTGG 1250 

CAAACATGAA AAATCGGCTT ACCATTAAAG TTCTCAATGC AAC CAT 1296 



Sequence No . : 20 
Sequence length: 3311 
Sequence type: Nucleic acid 
Strandedness : Double 



t'u ueiK (■ muu , - ft . - t. . 
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Original source: ^ 

Organism species: Bomo sapiens 
Ceil kind: Epidermoid carcinoma 
Cell line: KB 
Clone name: HP00714 
Sequence characteristics: 
r>~A„ ronrpqpntine characteristics: CDS 
Existence site: 57.. 1004 
Characterization method: E 
Sequence description ( 

GAGCGGCGGC CACGGCATCC TGTGCTGT&G GGGCTACGAG GAAAGATCTA ATTATC ATG 

Met 

GAC CTG CGA CAG TTT 
Asp Leu Arg Gin Phe 

5 

TTG AGC AAA CCC ACA 
Leu Ser Lys Pro Thr 
20 

CTC AGT GAC AAG GTT 
Leu Ser Asp Lys Val 
35 

GAT GCC TTC TTG GGT 
Asp Ala Phe Leu Gly 
50 

CCA GAA GAG AGC AAG 
Pro Glu Glu Ser Lys 



CTT ATG TGC CTG TCC CTG TGC ACA GCC TTT GCC 



107 



Leu Met Cys Leu Ser Leu Cys Thr Ala Phe Ala 



10 



15 



GAA AAG AAG GAC CGT GTA CAT CAT GAG CCT CAG 



155 



Glu Lys Lys Asp Arg Val His His Glu Pro Gin 



25 



30 



CAC AAT GAT GCT CAG AGT TTT GAT TAT GAC CAT 



203 



His Asn Asp Ala Gin Ser Phe Asp Tyr Asp His 



40 



45 



GCT GAA GAA GCA AAG ACC TTT GAT CAG CTG ACA 



251 



Ala Glu Glu Ala Lys Thr Phe Asp Gin Leu Thr 



55 



60 



65 



GAA AGG CTT GGA AAG ATT GTA AGT AAA ATA GAT 



299 



Glu Arg Leu Gly Lys lie Val Ser Lys He Asp 



■ t\t\ ' v 
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Gly Asp Lys Asp Gly F\he Val Thr Val Asp Glu Leu Lys Asp Trp lie 

85 90 95 

AAA TTT GCA CAA AAG CGC TGG ATT TAC GAG GAT GTA GAG CGA CAG TGG 395 
Lys Phe Ala Gin Lys Arg Trp He Tyr Glu Asp Val Glu Arg Gin Trp 

100 105 HO 

AAG GGG CAT GAC CTC AAT GAG GAC GGC CTC GTT TCC TGG GAG GAG TAT 443 
Lys Gly His Asp Leu Asn Glu Asp Gly Leu Val Ser Trp Glu Glu Tyr 



115 



120 



125 



AAA AAT GCC ACC TAC GGC TAC GTT TTA GAT GAT CCA GAT CCT GAT GAT 

Lys Asn Ala Thr Tyr Gly Tyr Val Leu Asp Asp Pro Asp Pro Asp Asp 

130 135 ( 140 145 

GGA TTT AAC TAT AAA CAG ATG ATCT'gTT AGA GAT GAG CGG AGG TTT AAA 

Gly Phe Asn Tyr Lys Gin Met Met Val Arg Asp Glu Arg Arg Phe Lys 



150 



155 



160 



ATG GCA GAC AAG GAT GGA GAC CTC ATT GCC ACC AAG GAG GAG TTC ACA 
Met Ala Asp Lys Asp Gly Asp Leu He Ala Thr Lys Glu Glu Phe Thr 



165 



170 



175 



GCT TTC CTG CAC CCT GAG GAG TAT GAC TAC ATG AAA GAT ATA GTA GTA 
Ala Phe Leu His Pro Glu Glu Tyr Asp Tyr Met Lys Asp He Val Val 



180 



185 



190 



CAG GAA ACA ATG GAA GAT ATA GAT AAG AAT GCT GAT GGT TTC ATT GAT 

Gin Glu Thr Met Glu Asp He Asp Lys Asn Ala Asp Gly Phe He Asp 



491 



539 



587 



635 



683 



195 



200 



205 



CTA GAA GAG TAT ATT GGT GAC ATG TAC AGC CAT GAT GGG AAT ACT GAT 
Leu Glu Glu Tyr He Gly Asp Met Tyr Ser His Asp Gly Asn Thr Asp 



731 



210 



215 



220 



225 



GAG CCA GAA TGG GTA AAG ACA GAG CGA GAG CAG TTT GTT GAG TTT CGG 



779 



• 
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GAT AAG AAC CGT GAT GGG AAG ATG GAC AAG GAA GAG ACC AAA GAC TGG 
Asp Lys Asn Arg Asp Gly Lys Met Asp Lys Glu Clu Thr Lys Asp Trp 



245 



250 



255 



827 



875 



923 



971 



1020 



ATC CTT CCC TCA GAC TAT GAT CAT GCA GAG GCA GAA GCC AGG CAC CTG 
He Leu Pro Ser Asp Tyr Asp His Ala Glu Ala Glu Ala Arg His Leu 

260 265 
CTC TAT GAA TCA GAC CAA AAC AAG GAT GGC AAG CTT ACC AAG GAG GAG 
Val Tyr Glu Ser Asp Gin Asn Lys Asp Gly Lys Leu Thr Lys Glu Glu 

275 2 80 285 

ATC GTT GAC AAG TAT GAC TTA TTT GTT GGC AGC CAG GCC ACA GAT TTT 
He Val Asp Lys Tyr Asp Leu Phe Val Gly Ser Gin Ala Thr Asp Fhe 

295 300 305 

290 Z3:> 

GGG GAG GCC TTA GTA CGG CAT GAT GAG TTC TGAGCTACGG AGGAACCCT 
Gly Glu Ala Leu Val Arg His Asp Glu Phe 

310 315 

CATTTCCTCA AAAGTAATTT ATTTTTACAG CTTCTGGTTT CACATGAAAT TGTTTGCGCT 1080 

ACTGAGACTG TTACTACAAA CTTTTTAAGA CATGAAAAGG CGTAATGAAA ACCATCCCGT 11*0 

CCCCATTCCT CCTCCTCTCT GAGGGACTGG AGGGAAGCCG TGCTTCTGAG GAACAACTCT 1200 

AATTAGTACA CTTGTGTTTG TAGATTTACA CTTTGTATTA TGTATTAACA TGGCGTGTTT 1260 

ATTTTTGTAT TTTTCTCTGG TTGGGAGTAT GATATGAAGG ATCAAGATCC TCAACTCACA 1320 

CATGTAGACA AACATTAGCT CTTTACTCTT TCTCAACCCC TTTTATGATT TTAATAATTC 1380 

TCACTTAACT AATTTTGTAA GCCTGAGATC AATAAGAAAT GTTCAGGAGA GAGGAAAGAA 1*40 

AAAAAATATA TGCTCCACAA TTTATATTTA GAGAGAGAAC ACTTAGTCTT GCCTGTCAAA 1500 

AAGTCCAACA TTTCATAGGT AGTAGGGGCC ACATATTACA TTCAGTTGCT ATAGGTCCAG 1560 

CAACTGAACC TGCCATTACC TGGGCAAGGA AAGATCCCTT TGCTCTAGGA AAGCTTGGCC 1620 

CAAATTGATT TTCTTCTTTT TCCCCCTGTA GGACTGACTG TTGGCTAATT TTGTCAAGCA 1680 

CAGCTGTGGT GGGAAGAGTT AGGGCCAGTG TCTTGAAAAT CAATCAAGTA GTGAATGTGA 17*0 
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TGCCTTTTGA AATCACTG^TA AATGCCCCCA TCCGGTTCCT CTTCTTCCCA GGTGTGCCAA 1920 

\ 

GGAATTAATC TTGGTTTCA^ TACAATTAAA ATTCACTCCT TTCCAATCAT GTCATTGAAA 1980 

GTGCCTTTAA CG AAAGAAAT GG TCACTGAA TGGGAATTCT CTTAAGAAAC CCTGAGATTA 2040 

AAAAAAGACT ATTTGGATAA CTTATAGGAA AGCCTAGAAC CTCCCAGTAG AGTGGGGATT 2100 

TTTTTCTTCT TCCCTTTCTC TTTTGGACAA TAGTTAAATT AGCAGTATTA GTTATGAGTT 2160 

TGGTTGCAGT GTTCTTATCT TGTGGGCTGA TTTCCAAAAA CCACATGCTG CTGAATTTAC 2220 

CAGGGATCCT CATACCTCAC AATGCAAACC ACTTACTACC AGGCCTTTTT CTGTGTCCAC 2280 

TGGAGAGCTT GAGCTCACAC TCAAAGATCA GAGGACCTAC AGAGAGGGCT CTTTGGTTTG 2340 

AGGACCATGG CTTACCTTTC CTGCCTTTGA CCCATCACAC CCCATTTCCT CCTCTTTCCC 2400 

TCTCCCCGCT GCCAAAAAAA AAAAAMAAG GAAACGTTTA TCATGAATCA ACAGGGTTTC 2460 

AGTCCTTATC AAAGAGAGAT GTGGAAAGAG CTAAAbAAAC CACCCTTTGT TCCCAACTGC ?S70 

ACTTTACCCA TATTTTATGC AACACAAACA CTGTCCTTTT GGGTCCCTTT CTTACAGATG 2580 

GACCTCTTGA GAAGAATTAT CGTATTCCAC GTTTTTAGCC CTCAGGTTAC CAAGATAAAT 2640 

ATATGTATAT ATAACCTTTA TTATTGCTAT ATCTTTGTGG ATAATACATT CAGGTGGTGC 2700 

TGGGTGATTT ATTATAATCT GAACCTAGGT ATATCCTTTG GTCTTCCACA GTCATGTTGA 2760 

GGTGGGCTCC CTGGTATGGT AAAAAGCCAG GTATAATGTA ACTTCACCCC AGCCTTTGTA 2820 

CTAAGCTCTT GATAGTGGAT ATACTCTTTT AAGTTTAGCC CCAATATAGG GTAATGGAAA 2880 

TTTCCTGCCC TCTGGGTTCC CCATTTTTAC TATTAAGAAG ACCAGTGATA ATTTAATAAT 2940 

GCCACCAACT CTGGCTTAGT TAAGTGAGAG TGTGAACTGT GTGGCAAGAG AGCCTCACAC 3000 

CTCACTAGGT GCAGAGAGCC CAGGCCTTAT GTTAAAATCA TGCACTTGAA AAGCAAACCT 3060 

TAATCTGCAA AGACAGCAGC AAGCATTATA CGGTCATCTT GAATGATCCC TTTGAAATTT 3120 

TTTTTTTGTT TGTTTGTTTA AATCAAGCCT GAGGCTGGTG AACAGTAGCT ACACACCCAT 3180 

ATTGTGTGTT CTGTGAATGC TAGCTTTCTT GAATTTGGAT ATTGGTTATT TTTTATAGAG 3240 

TGTAAACCAA GTTTTATATT CTGCAATGCG AACAGGTACC TATCTGTTTC TAAATAAAAC 3300 

TGTTTACATT C 3311 

Sequence No . : 21 
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Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Homo sapiens 

Cell kind: Stomach cancer 

nione name: HP00876 
Sequence characteristics: 
Code representing characteristics: CDS 

Existence site: 147.. 623 , 

. i 

Characterization method: E 
Sequence description 

ACTGGAGACA CTGAAGAAGG CAGGGGCCCT TAGAGTCTTG GTTGCCAAAC AGATTTGCAG 
ATCAAGGAGA ACCCAGGAGT TTCAAAGAAG CGCTAGTAAG GTCTCTGAGA TCCTTGCACT 
AGCTACATCC TCAGGGTAGG AGGAAG ATG GCT TCC AGA AGC ATG CGG CTG CTC 

Met Ala Ser Arg Ser Met Arg Leu Leu 
1 5 
CTA TTG CTG AGC TGC CTG GCC AAA ACA GGA GTC CTG GGT GAT ATC ATC 
L eu Leu Leu Ser Cys Leu Ala Lys Thr Gly Val Leu Gly Asp lie lie 

15 20 25 

10 15 

ATG AGA CCC AGC TGT GCT CCT GGA TGG TTT TAG CAC AAG TCC AAT TGC 
Met Arg Pro Ser Cys Ala Pro Gly Trp Phe Tyr His Lys Ser Asn Cys 

30 35 4° 

TAT GGT TAG TTC AGG AAG CTG AGG AAC TGG TCT GAT GCC GAG CTC GAG 
Tyr Gly Tyr Phe Arg Lys Leu Arg Asn Trp Ser Asp Ala Glu Leu Glu 

45 50 55 

TGT CAG TCT TAG GGA AAC GGA GCC CAC CTG GCA TCT ATC CTG AGT TTA 



60 
120 
173 



221 



269 



317 



365 



461 



509 
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AAG GAA GCC AGC ACC A?A GCA GAG TAG ATA AGT GGC TAT GAG AGA AGC 413 
Lys Glu Ala Ser Thr lie Ala Glu Tyr lie Ser Gly Tyr Gin Arg Ser 

75 80 85 

CAG CCG ATA TGG ATT GGC CTG CAC GAC CCA CAG AAG AGG CAG CAG TGG 
Gin Pro He Trp He Gly Leu His Asp Pro Gin Lys Arg Gin Gin Trp 
90 95 105 

CAG TGG ATT GAT GGG GCC ATG TAT CTG TAC AGA TCC TGG TCT GGC AAG 
Gin Trp He Asp Gly Ala Met Tyr Leu Tyr Arg ser up »I, 

110 H5 12° 

TCC ATG GGT GGG AAC AAG CAC TGT GCT GAG ATG AGC TCC AAT AAC AAC 

.... ~i_ m. .™ t„o mc r.»e. Ala Glu Met Ser Ser Asn Asn Asn 
ser rxeu oijf «ij ~ - - j - — 

125 130 135 

TTT TTA ACT TGG AGC AGC AAC GAA TGC AAC AAG CGC CAA CAC TTC CTG 
Phe Leu Thr Trp Ser Ser Asn Glu Cys Asn Lys Arg Gin His Phe Leu 

140 "5 I 50 

TGC AAG TAC CGA CCA TAGAGCAAGA ATCAAGATTC TGCTAACTCC 
Cys Lys Tyr Arg Pro 
155 



557 



605 



650 



TGCACAGCCC CGTCCTCTTC CTTTCTGCTA 


GCCTGGCTAA ATCTGCTCAT 


TATTTCAGAG 


710 


GGGAAACCTA GCAAACTAAG AGTGATAAGG 


GCCCTACTAC ACTGGCTTTT 


TTAGGCTTAG 


770 


AGACAGAAAC TTTAGCATTG GCC CAG TAG T 


GGCTTCTAGC TCTAAATGTT 


TGCCCCGCCA 


830 


TCCCTTTCCA CAGTATCCTT CTTCCCTCCT 


CCCCTGTCTC TGGCTGTCTC 


GAGCAGTCTA 


890 


GAAGAGTGCA TCTCCAGCCT ATGAAACAGC 


TGGGTCTTTG GCCATAAGAA 


GTAAAGATTT 


950 


GAAGACAGAA GGAAGAAACT CAGGAGTAAG 


CTTCTAGCCC CCTTCAGCTT 


CTACACCCTT 


1010 


CTGCCCTCTC TCCATTGCCT GCACCCCACC 


CCAGCCACTC AACTCCTGCT 


TGTTTTTCCT 


1070 


TTGGCCATGG GAAGGTTTAC CAGTAGAATC 


CTTGCTAGGT TGATGTGGGC 


CATACATTCC 


1130 


TTTAATAAAC CATTGTGTAC AT 






1152 



WO 98/11217 



PCT/JP97/03239 



97 



Sequence length: ^749 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 

Original source: 

A-^icTn snecies: Homo sapiens 

Cell kind: Liver 
Clone name: HP01134 
Sequence characteristics: - 
Code representing characteristics: CDS 
Existence site: 117.. 1247 
Characterization method: E 
Sequence description 

AATCACAGCA GTNCCGACGT CGTGGGTGTT TGGTGTGAGG CTGCGAGCCG CCGCGGCCAC 
CACTGCCACC ACGGTCGCCT GCCACAGGTG TCTGCAATTG AACTCCAAGG TGCAGA ATG 119 

Met 



GTT TOO AAA GTA OCT GTA TTC CTC AGT GTG GCC CTG GGC ATT GGT GCC 
Val Trp Lys Val Ala Val Phe Leu Ser Val Ala Leu Gly He Gly Ala 



10 15 



Val Pro He Asp Asp Pro Glu Asp Gly Gly Lys His Trp Val Val He 

25 30 
20 ZD 

nrr Trr TAT AAT TAT AGG CAC CAG GCA GAC GCG 
GTG GCA GGT TCA AAT GGC TGG TAT AAl mi * 

m Trr, Tvr Asn Tvr Arg His Gin Ala Asp Ala 
Val Ala Gly Ser Asn Gly Trp Tyr Asn iyr ^ 

/n * 5 



60 



167 



215 



263 
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50 35 60 65 

\ 

ATC GTT GTG ATG ATG TAC^ GAT GAC ATT GCT TAG TCT GAA GAC AAT CCC 359 

\ 

lie Val Val Met Met Tyr Asp Asp He Ala Tyr Ser Glu Asp Asn Pro 

70 75 80 

ACT CCA GGA ATT GTG ATC AAC AGG CCC AAT GGC ACA GAT GTC TAT CAG 407 
Thr Pro Gly He Val He Asn Arg Pro Asn Gly Thr Asp Val Tyr Gin 

«S 90 95 

GGA GTC CCG AAG GAC TAC ACT GGA GAG GAT GTT ACC CCA CAA AAT TTC 455 
Gly Val Pro Lys Asp Tyr Thr Gly Glu Asp Val Thr Pro Gin Asn Phe 
100 105 HO 



tr n n 
JUJ 



CTT GCT GTG TTG AGA GGC GAT GCA GAA GCA GTG AAG GGC ATA GGA TCC 

/ 

L- v 

Leu Ala Val Leu Arg Gly Asp Ala Glu Ala Val Lys Gly He Gly Ser 

115 120 125 

GGC AAA GTC CTG AAG AGT GGC CCC CAG GAT CAC GTG TTC ATT TAC TTC 551 
Gly Lys Val Leu Lys Ser Gly Pro Gin Asp His Val Phe He Tyr Phe 
130 135 140 145 

ACT GAC CAT GGA TCT ACT GGA ATA CTG GTT TTT CCC AAT GAA GAT CTT 599 
Thr Asp His Gly Ser Thr Gly He Leu Val Phe Pro Asn Glu Asp Leu 

150 155 160 

CAT GTA AAG GAC CTG AAT GAG ACC ATC CAT TAC ATG TAC AAA CAC AAA 647 
His Val Lys Asp Leu Asn Glu Thr He His Tyr Met Tyr Lys His Lys 

165 170 175 

ATG TAC CGA AAG ATG GTG TTC TAC ATT GAA GCC TGT GAG TCT GGG TCC 695 
Met Tyr Arg Lys Met Val Phe Tyr He Glu Ala Cys Glu Ser Gly Ser 

180 185 190 

ATG ATG AAC CAC CTG CCG GAT AAC ATC AAT GTT TAT GCA ACT ACT GCT 743 
Met Met Asn His Leu Pro Asp Asn He Asn Val Tyr Ala Thr Thr Ala 



A A . 




839 



BB7 



935 



983 
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Ala Asn Pro Arg Glu Ser Ser Tyr Ala Cys Tyr Tyr Asp Glu Lys Arg 

210 215 220 2 " 

TCC ACG TAC CTG GGG GAC TGG TAC AGC GTC AAC TGG ATG GAA GAC TCG 
Ser Thr Tyr Leu Gly Asp Trp Tyr Ser Val Asn Trp Met Glu Asp Ser 

230 235 240 

GAC GTG GAA GAT CTG ACT AAA GAG ACC CTG CAC AAG CAG TAC CAC CTG 
Aso Val Glu Asp Leu Thr Lys Glu Thr Leu His Lys Gin Tyr His Leu 

245 250 255 

GTA AAA TCG CAC ACC AAC ACC AGC CAC GTC ATG CAG TAT GGA AAC AAA 
Val Lys Ser His Thr Asn Thr Ser His Val Met Gin Tyr Gly Asn Lys 

260 265' 270 

ACA ATC TCC ACC ATG AAA GTG ATG CAG TTT CAG GGT ATG AAA CGC AAA 
Thr lie Ser Thr Met Lys Val Met Gin Phe Gin Gly Met Lys Arg Lys 

275 280 285 

GCC AGT TCT CCC GTC CCC CTA CCT CCA GTC ACA CAC CTT GAC CTC ACC 
Ala Ser Ser Pro Val Pro Leu Pro Pro Val Thr His Leu Asp Leu Thr 
290 295 300 305 

CCC AGC CCT GAT GTG CCT CTC ACC ATC ATG AAA AGG AAA CTG ATG AAC 
Pro Ser Pro Asp Val Pro Leu Thr He Met Lys Arg Lys Leu Met Asn 

310 315 320 

ACC AAT GAT CTG GAG GAG TCC AGG CAG CTC ACG GAG GAG ATC CAG CGG 
Thr Asn Asp Leu Glu Glu Ser Arg Gin Leu Thr Glu Glu lie Gin Arg 

325 330 335 

CAT CTG GAT TAC GAG TAT GCG TTG AGA CAT TTG TAC GTG CTG GTC AAC 
His Leu Asp Tyr Glu Tyr Ala Leu Arg His Leu Tyr Val Leu Val Asn 

340 345 350 

CTT TGT GAG AAG CCG TAT CCG CTT CAC AGG ATA AAA TTG TCC ATG GAC 1223 



1031 



1079 



1127 



1175 
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Sequence No. : 23 

Sequence length: 988 

Sequence type: Nucleic acid 

Strandedness: Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Homo sapiens 
Cell kind: Epidermoid carcinoma 
Cell line: KB 
Clone name: HP10029 
Sequence characteristics: 
Code representing characteristics: CDS 
Existence site: 9.. 530 
Characterization method: E 



1270 



100 

CAC GTG TGC CTT GGT CAC TAG TGAAGAGCTG CCTCCTGGAA GCTTTT 
His Val Cys Leu Gly His Tyr 
370 375 

CCAAGTGTGA GCGCCCCACC GACTGTGTGC TGATCAGAGA CTGGAGAGGT GGAGTGAGAA 1330 
GTCTCCGCTG CTCGGGCCCT CCTGGGGAGC CCCCGCTCCA GGGCTCGCTC CAGGACCTTC 1390 
TTCACAAGAT GACTTGCTCG CTGTTACCTG CTTCCCCAGT CTTTTCTGAA AAACTACAAA 1450 
TTAGGGTGGG AAAAGCTCTG TATTGAGAAG GGTCATATTT GCTTTCTAGG AGGTTTGTTG 1510 
TTTTGCCTGT TAGTTTTGAG GAGCAGGAAG CTCATGGGGG CTXCTGXA", o^i—n — 
AGGAGTCTTT ATTCTGAGAA TTTGAAGCTG AAACCTCTTT AAATCTTCAG AATGATTTTA 1630 
TTGAAGAGGG CCGCAAGCCC CAAATGGAAA ACTGTTTTTA GAAAATATGA TGATTTTTGA 1690 
a AfiTfTTA A A AAATAAAGAT TTATAACAG 1749 

T TG C T XT XU 1 A'l I IAA HOI vjrwrvvj^ ±u±x~ — 
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Met Ala Ala Pro Ser Gly Cly Trp Asn Gly Val Arg Ala Ser 



10 



TTG TGG GCC GCG CTG CTC CTA GGG GCC GTG GCG CTG AGO CCG GCG GAG 
Leu Trp Ala Ala Leu Leu Leu Gly Ala Val Ala Leu Arg Pro Ala Glu 



15 



20 



25 



30 



GCG GTG TCC GAG CCC ACG ACC GTG GCG TTT GAC GTG CGG CCC GGC GGC 
vl Ser Glu Pro Thr Thr Val Ala Phe Asp Val Arg Pro Gly Gly 



35 



40 



A5 



GTC GTG CAT TCC TTC TCC CAT AAC GTG GGC CCG GGG GAC AAA TAT ACG 
Val Val His Ser Phe Ser His Asn Val Gly Pro Gly Asp Lys Tyr Thr 



50 



55 



60 



TGT ATG TTC ACT TAG GCC TCT CAA GGA GGG ACC AAT GAG CAA TGG GAG 
Cys Met Phe Thr Tyr Ala Ser Gin Gly Gly Thr Asn Glu Gin Trp Gin 



65 



70 



75 



ATG AGT CTG GGG ACC AGC GAA GAC CAC CAG CAC TTC ACC TGC ACC ATC 
Met Ser Leu Gly Thr Ser Glu Asp His Gin His Phe Thr Cys Thr lie 



80 



85 



90 



TGG AGG CCC CAG GGG AAG TCC TAT CTG TAG TTC ACA CAG TTC AAG GCA 
Trp Arg Pro Gin Gly Lys Ser Tyr Leu Tyr Phe Thr Gin Phe Lys Ala 



95 



100 



105 



110 



GAG GTG CGG GGC GCT GAG ATT GAG TAG GCC ATG GCC TAC TCT AAA GCC 
Glu Val Arg Gly Ala Glu lie Glu Tyr Ala Met Ala Tyr Ser Lys Ala 



115 



120 



125 



GCA TTT GAA AGG GAA AGT GAT GTC CCT CTG AAA ACT GAG GAA TTT GAA 
Ala Phe Glu Arg Glu Ser Asp Val Pro Leu Lys Thr Glu Glu Phe Glu 



130 



135 



140 



GTG ACC AAA ACA GCA GTG GCT CAC AGG CCC GGG GCA TTC AAA GCT GAG 



98 



146 



194 



242 



290 



338 



386 



434 



482 
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CTG TCC AAG CTG GTG ATT GTG GCC AAG GCA TCG CGC ACT GAG CTG 
Leu Ser Lys Leu Val He Val Ala Lys Ala Ser Arg Thr Glu Leu 



160 




165 




170 






TGA CCAGCAGCCC TGTTGCGGGT GGCACCTTCT CATCTCCGGT GAAGCTGAAG 


580 


GGGCCTGTGG 


CCCTGAAAGG 


GCCAGCACAT 


CACTGGTTTT 


CTAGGAGGGA 


CTCTTAAGTT 


640 


TTCTACCTGG 


GCTGACGTTG 


CCTTGTCCGG 


AGGGGCTTGC 


AGGGTGGCTG 


AAGCCCTGGG 


700 


GCAGAGAACA 


GAGGGTCCAG 


GGCCCTCCTG 


GCTCCCAACA 


GCTTCTCAGT 


TCCCACTTCC 


760 


TGCTGAGCTC 


TTCTGGACTC 


AGGATCGCAG 


ATCCGGGGCA 


CAAAGAGGGT 


GGGGAACATG 


820 


GGGG CTATGC 


TGGGGAAAGC 


AGCCATGCTC 


CCCCCGACCT 


CCAGCCGAGC 


ATCCTTCATG 


880 


AGCCTGCAGA 


ACTGCTTTCC 


TATGTTTACC 


CAGGGGACCT 


CCTTTCAGAT 


GAACTGGGAA 


940 


GAGATGAAAT 


GTTTTTTCAT 


ATTTAAATAA 


ATAAGAACAT 


TAAAAAGC 




OOP 



Sequence No.: 24 

Sequence length: 390 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Homo sapiens 

Cell kind: Epidermoid carcinoma 

Cell line: KB 

Clone name: HP10189 
Sequence characteristics: 
Code representing characteristics: CDS 
Existence site: 102.. 323 
Characterization method: E 
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GGCTTTGGGC GGAACTGGCT TTGTTGACCG GGAGAAACGA G ATG GGG GTG AAG CTG 116 

Met Gly Val Lys Leu 



GAG ATA TTT CGG ATG ATA ATC TAC CTC ACT TTC CCT GTG GCT ATG TTC 164 
Glu He Phe Arg Met He He Tyr Leu Thr Phe Pro Val Ala Met Phe 

10 15 20 

TUG Gii i-o C-\? w t^C TTT GAG GAC GAT GTC ATA CAG CGC 212 

Trp Val Ser Asn Gin Ala Glu Trp Phe Glu Asp Asp Val He Gin Arg 

25 30 35 

AAG AGG GAG CTG TGG CCA CCT GAG AAG CTT CAA GAG ATA GAG GAA TTC 260 
Lys Arg Glu Leu Trp Pro Pro Glu Lys Leu Gin Glu He Glu Glu Phe 

40 %5 50 

AAA GAG AGG TTA CGG AAG CGG CGG GAG GAG AAG CTC CTT CGC GAC GCC 308 
Lys Glu Arg Leu Arg Lys Arg Arg Glu Glu Lys Leu Leu Arg Asp Ala 

55 60 65 

CAG CAG AAC TCC TGAGGCCTCC AAGTGGGAGT CCTAGCCCCT 350 
Gin Gin Asn Ser 
70 

CCCCTGATGA AATATACATA TACTCAGTTC CTTGTTATTC 390 



Sequence No . : 25 

Sequence length: 4667 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 



1 T1 
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Cell line: U937 

Clone name: HP10269 
Sequence characteristics: 
Code representing characteristics: CDS 
Existence site : 754.. 4272 
Characterization method: E 



Sequence description 












CATTTAGTTA 


CTCTGCTCAT 


TTCTCTTAAG 


CTTTCCTTGG 


ATGAGTTGAG 


CTTTGAATCC 


60 


TTCCTGATGA 


ACCTTGCCTT 


TTAAGGATCC 


TCCAAATGCC 


CCAAGAAGCT 


GGGATTTTTC 


120 


ATTTTTTTTT 


TCACTGGGGA 


GGGGAATGGT 


GCTTTCCAGG 


GTCCTGGATG 


TTTGAGTCTT 


180 


CTCACCTTCC 


AGCCCGGTGA 


TATGTCTGGA 








m. n 

J_ -» v 


TGTGTTCTCT 


GCCTGATCTT 


CTGTCTGGGG 


TGGTCCAGGT 


CACAAGAAGA AGCTGACCCC 


300 


TGCTGGCTTT 


GGGAAAATGC 


TGAGTTCATT 


GCCTGGCACA 


AATGCAAGGG 


CCCTTCCCCA 


360 


CCCTGTGAAT 


TCTGGTCTCT 


GATGATCACT 


TACATGTGCC 


TTGTGCTTTC 


TGTTTGAGGG 


420 


GCCCCTTGCA 


GCCCCCACAG 


GCAGGTGGGC 


ATTGTGGAGC 


TC ACT AC AAG 


AACTCTGGGA 


480 


CCGACCGACC 


AACCCACTTG 


CCCAGTCCCG 


TCCTGGGAGG 


TGGGGGTGCA 


GTGACGACAG 


540 


ATGGGTGTGA 


CGGCTGCCAG 


ATTCCTGAGA 


CCCGCCCTGC 


GGTGGGGCTA 


CACCCAGCCA 


600 


GGGAGTCTCC 


AGAGGTGAGG 


CTGTTGTTTA AAAACCTGGA GCCGGGAGGG GAGACCCCCA 


660 


CATTCAAGAG 


GAGCTTTCAG 


GCGATCTGGA GAAAGAACGG 


CAGAACACAC 


AGCAAGGAAA 


720 


GGTCCTTTCT 


GGGGATCACC 


CCATTGGCTG 


AAG ATG AGA CCA TTC TTC CTC TTG 


774 



Met Arg Pro Phe Phe Leu Leu 

1 5 

TGT TTT GCC CTG CCT GGC CTC CTG CAT GCC CAA CAA GCC TGC TCC CGT 822 
Cys Phe Ala Leu Pro Gly Leu Leu His Ala Gin Gin Ala Cys Ser Arg 

10 15 20 

GGG GCC TGC TAT CCA CCT GTT GGG GAC CTG CTT GTT GGG AGG ACC CGG 870 
Gly Ala Cys Tyr Pro Pro Val Gly Asp Leu Leu Val Gly Arg Thr Arg 
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Phe Leu Arg Ala Ser Ser 

AO 45 

TGC ACC CAG TAT GGC GAG 

Cys Thr Gin Tyr Gly Glu 

60 

AGG CAG CCT CAC AAC TAC 

— »-*■-* tf»„^. 

Arg oiii 4.^.*- ~- - J 

75 

TCC TCC GGC CCC ATG CGC 

Ser Ser Gly Pro Met Arg 
90 

GTC TCT CTG CAG CTG GAC 

Val Ser Leu Gin Leu Asp 
105 

ATG ATG GAG TTC CAG GGG 

Met Met Glu Phe Gin Gly 

120 125 

TCC TCA GAC TTC GGT AAG 

Ser Ser Asp Phe Gly Lys 

140 

GAC TGC ACC TCC ACC TTC 

Asp Cys Thr Ser Thr Phe 
155 

TGG CAG GAT GTT CGG TGC 
Trp Gin Asp Val Arg Cys 
170 

CTA AAT GGG GGG AAG GTC 



105 

Thr Cys Gly Leu Thr Lys 

50 

TGG CAG ATG AAA TGC TGC 
Trp Gin Met Lys Cys Cys 

65 

TAC AGT CAC CGA GTA GAG 
q Pr His Are Val Glu 
80 

TGG TGG CAG TCC CAG AAT 
Trp Trp Gin Ser Gin Asn 
95 

CTG GAC AGG AGA TTC CAG 
Leu Asp Arg Arg Phe Gin 
110 115 
CCC ATG CCT GCC GGC ATG 
Pro Met Pro Ala Gly Met 

130 

ACC TGG CGA GTG TAC CAG 
Thr Trp Arg Val Tyr Gin 
145 

CCT CGG GTC CGC CAG GGT 
Pro Arg Val Arg Gin Gly 
160 

CAG TCC CTG CCT CAG AGG 
Gin Ser Leu Pro Gin Arg 
175 

CAA CTT AAC CTT ATG GAT 
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Pro Glu Thr Tyr 

55 

AAG TGT GAC TCC 966 
Lys Cys Asp Ser 
70 

AAT GTG GCT TCA 1014 
Asn Val Ala Ser 
85 

GAT GTG AAC CCT 1062 

Asp Val Asn Pro 

100 

CTT CAA GAA GTC 1110 
Leu Gin Glu Val 

CTG ATT GAG CGC 1158 
Leu lie Glu Arg 

135 

TAC CTG GCT GCC 1206 
Tyr Leu Ala Ala 
150 

CGG CCT CAG AGC 1254 
Arg Pro Gin Ser 
165 

CCT AAT GCA CGC 1302 

Pro Asn Ala Arg 

180 

TTA GTG TCT GGG 1350 
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106 

ATT CCA GCA ACT CAA AGT CAA AAA ATT CAA GAG GTG GGG GAG ATC ACA 
lie Pro Ala Thr Gin Ser Gin Lys He Gin Glu Val Gly Glu lie Thr 
200 205 210 215 

AAC TTG AGA GTC AAT TTC ACC AGG CTG GCC CCT GTG CCC CAA AGG GGC 1*46 
Asn Leu Arg Val Asn Phe Thr Arg Leu Ala Pro Val Pro Gin Arg Gly 

220 22 5 230 

tap r.AC. CCT CCC AGC GCC TAC TAT GCT GTG TCC CAG CTC CGT CTG CAG 149* 
Tyr His Pro Pro Ser Ala Tyr Tyr Ala Val Ser Gin Leu Arg Leu Gin 

235 240 245 

GGG AGC TGC TTC TGT CAC GGC CAT GCT GAT CGC TGC GCA CCC AAG CCT 1542 
Gly Ser Cys Phe Cys His Gly His Ala Asp Arg Cys Ala Pro Lys Pro 

250 255 260 

GGG GCC TCT GCA GGC CCC TCC ACC GCT GTG CAG GTC CAC GAT GTC TGT 1590 
Gly Ala Ser Ala Gly Pro Ser Thr Ala Val Gin Val His Asp Val Cys 

265 270 275 

GTC TGC CAG CAC AAC ACT GCC GGC CCA AAT TGT GAG CGC TGT GCA CCC 1638 
Val Cys Gin His Asn Thr Ala Gly Pro Asn Cys Glu Arg Cys Ala Pro 
280 285 2^0 295 

TTC TAC AAC AAC CGG CCC TGG AGA CCG GCG GAG GGC CAG GAC GCC CAT 1686 
Phe Tyr Asn Asn Arg Pro Trp Arg Pro Ala Glu Gly Gin Asp Ala His 

300 305 310 

GAA TGC CAA AGG TGC GAC TGC AAT GGG CAC TCA GAG ACA TGT CAC TTT 1734 
Glu Cys Gin Arg Cys Asp Cys Asn Gly His Ser Glu Thr Cys His Phe 

315 320 325 

GAC CCC GCT GTG TTT GCC GCC AGC CAG GGG GCA TAT GGA GGT GTG TGT 1782 
Asp Pro Ala Val Phe Ala Ala Ser Gin Gly Ala Tyr Gly Gly Val Cys 
330 335 340 
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345 

CTG CAC TAT TTC CGG AAC 
Leu His Tyr Phe Arg Asn 
360 365 
TGC ATC TCC TGC GAG TGT 
Cys lie Ser Cys Glu Cys 

TGT GAC CCA GTG ACC GGG 
Cys Asp Pro Val Thr Gly 

395 

GAG CGC TGT GAC CTA TGC 
Glu Arg Cys Asp Leu Cys 
410 

AAC CCG CAG GGC TGC CAC 
Asn Pro Gin Gly Cys His 
425 

AGG GAC ATG CCG TGT GAC 
Arg Asp Met Pro Cys Asp 
440 445 
AAC GTG GTG GGT CCC AAA 
Asn Val Val Gly Pro Lys 

460 

CTG GCC AGT GGC CAG GGC 
Leu Ala Ser Gly Gin Gly 

475 

TCC CTC AGC CCA CAG TGC 
Ser Leu Ser Pro Gin Cys 



107 

350 355 
CGG CGC CCG GGA GCT TCC 
Arg Arg Pro Gly Ala Ser 

370 

GAT CCG GAT GGG GCA GTG 
Asp Pro Asp Gly Ala Val 

385 

CAG TGT GTG TGC AAG GAG 
Gin Cys Val Cys Lys Glu 
400 

AAG CCG GGC TTC ACT GGA 
Lys Pro Gly Phe Thr Gly 
415 

CGC TGT GAC TGC AAC ATC 
Arg Cys Asp Cys Asn lie 
430 435 
GAG GAG AGT GGG CGC TGC 
Glu Glu Ser Gly Arg Cys 

450 

TGT GAC CAG TGT GCT CCC 
Cys Asp Gin Cys Ala Pro 

465 

TGT GAA CCG TGT GCC TGC 
Cys Glu Pro Cys Ala Cys 
480 

AAC CAG TTC ACA GGG CAG 
Asn Gin Phe Thr Gly Gin 
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ATT CAG GAG ACC 1878 

He Gin Glu Thr 
375 

CCA GGG GCT CCC 1926 

Pro Gly Ala Pro 
390 

CAT GTG CAG GGA 1974 

His Val Gin Gly 
405 

CTC ACC TAC GCC 2022 

Leu Thr Tyr Ala 

420 

CTG GGG TCC CGG 2070 

Leu Gly Ser Arg 

CTT TGT CTG CCC 2118 
Leu Cys Leu Pro 
455 

TAC CAC TGG AAG 2166 
Tyr His Trp Lys 
470 

GAC CCG CAC AAC 2214 
Asp Pro His Asn 
485 

TGC CCC TGT CGG 2262 
Cys Pro Cys Arg 
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Glu Gly Phe Gly Gly Leu Met Cys Ser Ala Ala Ala He Arg Gin Cys 

505 510 515 

CCA GAC CGG ACC TAT GGA GAC GTG GCC ACA GGA TGC CGA GCC TGT GAC 2358 
Pro Asp Arg Thr Tyr Gly Asp Val Ala Thr Gly Cys Arg Ala Cys Asp 
520 525 530 535 

TGT GAT TTC CGG GGA ACA GAG GGC CCG GGC TGC GAC AAG GCA TCA GGC 2406 
Cys Asp Phe Arg Gly Thr Glu Gly Pro Gly Cys Asp Lys Ala Ser Gly 

540 545 550 

CGC TGC CTC TGC CGC CCT GGC TTG ACC GGG CCC CGC TGT GAC CAG TGC 2454 
Arg Cys Leu Cys Arg Pro Gly Leu Thr Gly Pro Arg Cys Asp Gin Cys 



555 560 
CAG CGA GGC TAC TGC AAT CGC TAC CCG GTG TGC GTG GCC TGC CAC CCT 2502 
Gin Arg Gly Tyr Cys Asn Arg Tyr Pro Val Cys Val Ala Cys His Pro 

570 575 580 

TGC TTC CAG ACC TAT GAT GCG GAC CTC CGG GAG CAG GCC CTG CGC TTT 2550 
Cys Phe Gin Thr Tyr Asp Ala Asp Leu Arg Glu Gin Ala Leu Arg Phe 

585 590 595 

GGT AGA CTC CGC AAT GCC ACC GCC AGC CTG TGG TCA GGG CCT GGG CTG 2598 
Gly Arg Leu Arg Asn Ala Thr Ala Ser Leu Trp Ser Gly Pro Gly Leu 
600 605 610 615 

GAG GAC CGT GGC CTG GCC TCC CGG ATC CTA GAT GCA AAG AGT AAG ATT 2646 
Glu Asp Arg Gly Leu Ala Ser Arg He Leu Asp Ala Lys Ser Lys He 

620 625 630 

GAG CAG ATC CGA GCA GTT CTC AGC AGC CCC GCA GTC ACA GAG CAG GAG 2694 
Glu Gin He Arg Ala Val Leu Ser Ser Pro Ala Val Thr Glu Gin Glu 

635 640 645 

GTG GCT CAG GTG GCC AGT GCC ATC CTC TCC CTC AGG CGA ACT CTC CAG 2742 
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GGC CTG CAG CTG GAT CTG CCC CTG GAG GAG GAG ACG TTG TCC CTT CCG 
Gly Leu Gin Leu Asp Leu Pro Leu Glu Glu Glu Thr Leu Ser Leu Pro 

665 670 675 

AGA GAC CTG GAG AGT CTT GAC AGA AGC TTC AAT GGT CTC CTT ACT ATG 
Arg Asp Leu Glu Ser Leu Asp Arg Ser Phe Asn Gly Leu Leu Thr Met 

fiRS 69° 695 

680 t>8: ' 

... .„„ r.>n n.r ttt r.AA AAA ATA AGC AGT GCT GAT CCT 

1AA UAv» nww — 

Tyr Gin Arg Lys Arg Glu Gin Phe Glu Lys lie Ser Ser Ala Asp Pro 

700 705 710 

TCA GGA GCC TTC CGG ATG CTG AGC ACA GCC TAC GAG CAG TCA GCC CAG 
Ser Gly Ala Phe Arg Met Leu Ser Thr Ala Tyr Glu Gin Ser Ala Gin 

715 720 7 " 

GCT GCT CAG CAG GTC TCC GAC AGC TCG CGC CTT TTG GAC CAG CTC AGG 
Ala Ala Gin Gin Val Ser Asp Ser Ser Arg Leu Leu Asp Gin Leu Arg 

GAC AGC CGG AGA GAG GCA GAG AGG CTG GTG CGG CAG GCG GGA GGA GGA 
Asp Ser Arg Arg Glu Ala Glu Arg Leu Val Arg Gin Ala Gly Gly Gly 

745 750 755 

GGA GGC ACC GGC AGC CCC AAG CTT GTG GCC CTG AGG CTG GAG ATG TCT 
Gly Gly Thr Gly Ser Pro Lys Leu Val Ala Leu Arg Leu Glu Met Ser 

765 770 775 

760 763 

TCG TTG CCT GAC CTG ACA CCC ACC TTC AAC AAG CTC TGT GGC AAC TCC 
Ser Leu Pro Asp Leu Thr Pro Thr Phe Asn Lys Leu Cys Gly Asn Ser 

780 785 790 

AGG CAG ATG GCT TGC ACC CCA ATA TCA TGC CCT GGT GAG CTA TGT CCC 
Arg Gin Met Ala Cys Thr Pro lie Ser Cys Pro Gly Glu Leu Cys Pro 
795 



2790 



2838 



2886 



2934 



2982 



3030 



3078 



3126 



3174 



800 805 



PCT/JP97/03239 

WO 98/11217 

110 

820 



3270 



810 , 815 

AGG GCC GGT GGG GCC TTC TTG ATG GCG GGG CAG GTG GCT GAG CAG CTG 
Arg Ala Gly Gly Ala Phe Leu Met Ala Gly Gin Val Ala Clu Gin Leu 

825 830 835 

CGG GGC TTC AAT GCC CAG CTC CAG CGG ACC AGG CAG ATG ATT AGG GCA 3318 
Arg Gly Phe Asn Ala Gin Leu Gin Arg Thr Arg Gin Met lie Arg Ala 
had 845 850 855 

GCC GAG GAA TCT GCC TCA CAG ATT CAA TCC AGT GCC CAG CGC TTG GAG 3366 
Ala Glu Glu Ser Ala Ser Gin He Gin Ser Ser Ala Gin Arg Leu Glu 

860 865 870 

ACC CAG GTG AGC GCC AGC CGC TCC CAG ATG GAG GAA GAT GTC AGA CGC 3414 

Thr Gin Val Ser Ala Ser Arg Ser Gin Met Glu Glu Asp Val Arg Arg 

\ 

875 880 885 

ACA CGG CTC CTA ATC CAG CAG GTC CGG GAC TTC CTA ACA GAC CCC GAC 3462 
Thr Arg Leu Leu lie Gin Gin Val Arg Asp Phe Leu Thr Asp Pro Asp 

890 895 900 

ACT GAT GCA GCC ACT ATC CAG GAG GTC AGC GAG GCC GTG CTG GCC CTG 3510 
Thr Asp Ala Ala Thr He Gin Glu Val Ser Glu Ala Val Leu Ala Leu 

905 910 915 

TGG CTG CCC ACA GAC TCA GCT ACT GTT CTG CAG AAG ATG AAT GAG ATC 3558 
Trp Leu Pro Thr Asp Ser Ala Thr Val Leu Gin LyB Met Asn Glu He 
920 925 930 935 

CAG GCC ATT GCA GCC AGG CTC CCC AAC GTG GAC TTG GTG CTG TCC CAG 3606 
Gin Ala He Ala Ala Arg Leu Pro Asn Val Asp Leu Val Leu Ser Gin 

940 945 950 

ACC AAG CAG GAC ATT GCG CGT GCC CGC CGG TTG CAG GCT GAG GCT GAG 3654 
Thr Lys Gin Asp lie Ala Arg Ala Arg Arg Leu Gin Ala Glu Ala Glu 
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c a A1a Hie Ala Val Glu Gly Gin Val Glu Asp Val 
Glu Ala Arg Ser Arg Ala His Ala va± i 

975 980 
970 y/D 

GTT GGG AAC CTG CGG CAG GGG ACA GTG GCA CTG GAG GAA GGT CAG GAG 
Val Gly Asn Leu Arg Gin Gly Thr Val Ala Leu Gin Glu Ala Gin Asp 
985 990 " 5 

.... ,i„ r.w Thr Ser Arg Ser Leu Arg Leu lie Gin Asp Arg Val 

1005 « 10 1015 

1000 1UU0 

GCT GAG GTT CAG CAG GTA CTG CGG CCA GCA GAA AAG CTG GTG ACA AGC 
Aia Glu Val Gin Gin Val Leu Arg Pro Ala Glu Lys Leu Val Thr Ser 

m?5 1030 
1020 luz:) 

ATG ACC AAG CAG CTG GGT GAC TT« TGG ACA GGG ATG GAG GAG CTC CGC 

Met Thr Lys Gin Leu Gly Asp Phe Tr P Thr Arg Met Glu Glu Leu Arg 

X035 1040 1045 

CAC GAA GCC CGG CAG CAG GGG GCA GAG GCA GTG CAG GCC CAG CAG CTT 

His Gin Ala Arg Gin Gin Gly Ala Glu Ala Val Gin Ala Gin Gin Leu 

m55 1060 
1050 10:5:3 

Ma Glu Gly Ala Ser Glu Gin Ala Leu Ser Ala Gin Glu Gly Phe Glu 

1065 1070 1075 

AGA ATA AAA CAA AAG TAT GCT GAG TTG AAG GAC CGG TTG GGT CAG ACT 
Arg He Lys Gin Lys Tyr Ala Glu Leu Lys Asp Arg Leu Gly Gin Ser 



10Q0 1° 95 
1085 1UyU 



1080 

TCC ATG CTG GGT GAG CAG GGT GCC CGG ATC CAG AGT GTG AAG ACA GAG 
Ser Met Leu Gly Glu Gin Gly Ala Arg He Gin Ser Val Lys Thr Glu 



1105 1110 

1100 11UD 



GCA GAG GAG CTG TTT GGG GAG ACC ATG GAG ATG ATG GAC AGG ATG AAA 



3750 



3798 



3846 



3894 



3942 



3990 



4038 



4086 



4134 
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GAC ATG GAG TTG GAG CTdj CTG CGG GGC AGC CAG GCC ATC ATG CTG CGC 4182 
Asp Met Glu Leu Glu Leu^Leu Arg Gly Ser Gin Ala He Met Leu Arg 

1130 1135 1140 

TCA GCG GAC CTG ACA GGA CTG GAG AAG CGT GTG GAG CAG ATC CGT GAC 4230 
Ser Ala Asp Leu Thr Gly Leu Glu Lys Arg Val Glu Gin He Arg Asp 

1145 1150 H55 

CAC ATC AAT GGG CGC GTG CTC TAC TAT GCC ACC TGC AAG T 4270 
His He Asn Gly Arg Val Leu Tyr Tyr Ala Thr Cys Lys 
1160 1165 H70 

GATGCTACAG CTTCCAGCCC GTTGCCCCAC TCATCTGCCG CCTTTGCTTT TGGTTGGGGG 4330 
CAGATTGGGT TGGAATGCTT TCCATCTG&^JGGAGACTTTC ATGCAGCCTA AAGTACACCC A 390 
TGGACCACCC CTGGTGTGTA GCTAGTAAGA TTACCCTGAG CTGCAGCTGA GCCTGAGCCA 4450 
ATGGGACAGT TACACTTGAC AGACAAAGAT GGTGGAGATT GGCATGCCAT TGAAACTAAG 4510 
AGCTCTCAAG TCAAGGAAGC TGGGCTGGGC AGTATCCCCC GCCTTTAGTT CTCCACTGGG 4570 
GAGGAATCCT GGACCAAGCA CAAAAACTTA ACAAAAGTGA TGTAAAAATG AAAAGCCAAA 4630 
TAAAAATCTT TGGAAAAGAG CCTGGAGGTT CAACGAG 4667 

Sequence No . : 26 

Sequence length: 1086 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species: Bomo sapiens 

Cell kind: Stomach cancer 

Clone name: HP10298 
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Existence site: 13tJ.. 506 
Characterization method: E 
Sequence description 

TTTAATTTCC CCGAAATCAG ACTGCTGCCT TGGACCGGGA CAGCTCGCGG CCCCCGAGAG 60 
CTCTAGCCGT CGAGGAGCTG CCTGGGGACG TTTGCCCTGG GGCCCCAGCC TGGCCCGGGT 120 
CACCCTGGCA TGAGGAG ATG GGC CTG TTG CTC CTG GTC CCA TTG CTC CTG 170 

r*i„ t ion t.pu Leu Val Pro Leu Leu Leu 
1 5 10 

CTG CCC GGC TCC TAC GGA CTG CCC TTC TAC AAC GGC TTC TAC TAC TCC 218 
Leu Pro Gly Ser Tyr Gly Leu Pro Phe Tyr Asn Gly Phe Tyr Tyr Ser 

15 / 20 25 

AAC AGC GCC AAC GAC CAG AAC CTA cic AAC GGT CAT GGC AAA GAC CTC 266 
Asn Ser Ala Asn Asp Gin Asn Leu Gly Asn Gly His Gly Lys Asp Leu 

30 35 40 

CTT AAT GGA GTG AAG CTG GTG GTG GAG ACA CCC GAG GAG ACC CTG TTC 314 
Leu Asn Gly Val Lys Leu Val Val Glu Thr Pro Glu Glu Thr Leu Phe 

45 50 55 

ACC CGC ATC CTA ACT GTG GGC CCC CAG AGC CTG GGG TCC GAA GCT TTG 362 
Thr Arg He Leu Thr Val Gly Pro Gin Ser Leu Gly Ser Glu Ala Leu 
60 65 70 75 

GCT TCC CCG ACC CGC AGA GCC GCT TGT ACG GTG TTT ACT GCT ACC GCC 410 
Ala Ser Pro Thr Arg Arg Ala Ala Cys Thr Val Phe Thr Ala Thr Ala 

80 85 90 

AGC ACT AGG ACC TGG GGC CCT CCC CTG CCG CAT TCC CTC ACT GGC TGT 458 
Ser Thr Arg Thr Trp Gly Pro Pro Leu Pro His Ser Leu Thr Gly Cys 

95 100 105 

OTA TTT ATT GAG TGG TTC GTT TTC CCT TGT GGG TTG GAG CCA TTT 503 
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Sequence description 
ACTCAGAAGC TTGGACCGCA TCCTAGCCGC CGACTCACAC AAGGCAGGTG GGTGAGGAAA 



560 



1086 
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TAACTGT TTTTATACTT CTCAATTTAA ATTTTCTTTA AACATTTTTT TACTATTTTT 

TGTAAAGCAA ACAGAACCCA aWcCCT TTGCTCCTGG ATGCCCCACT CCAGGAATCA 620 

TGCTTGCTCC CCTGGGCCAT TTGttGGTTTT GTGGGCTTCT GGAGGGTTCC CCGCCATCCA 680 

GGCTGGTCTC CCTCCCTTAA GGAGCSTTGGT GCCCAGAGTG GGCGGTGGCC TGTCTAGAAT 740 

GCCGCCGGGA GTCCGGGCAT GGTGGGCACA GTTCTCCCTG CCCCTCAGCC TGGGGGAAGA 800 

AGAGGGCCTC GGGGGCCTCC GGAGCTGGGC TTTGGGCCTC TCCTGCCCAC CTCTACTTCT 860 

^rs.r.rr r.r.TGAf.r.CCA GTCTGCCCAC TGAGGGGCTA GGGCTGGAAG CCAGTTCTAG 920 

GCTTCCAGGC GAAAGCTGAG GGAAGGAAGA AACTCCCCTC CCCGTTCCCC TTCCCCTCTC 980 

GGTTCCAAAG AATCTGTTTT GTTGTCATTT GTTTCTCCTG TTTCCCTGTG TGGGGAGGGG 1040 
CCCTCAGGTG TGTGTACTTT GGACAATAAA TGGTGCTATG ACTGCC 

( . 

Sequence No. : 27 
Sequence length: 866 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Homo sapiens 

Cell kind: Stomach cancer 
Clone name: HP10368 
Sequence characteristics: 

Code representing characteristics: CDS 

Existence site: 73.. 600 

Characterization method: E 
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10 



GCC CTC TCC TAC ACT CTG GCC AGA GAT ACC ACA GTC AAA CCT GGA GCC 
Ala Leu Ser Tyr Thr Leu Ala Arg Asp Thr Thr Val Lys Pro Gly Ala 



15 



20 



25 



AAA AAG GAC ACA AAG GAC TCT CGA CCC AAA CTG CCC CAG ACC CTC TCC 
Lys Lys Asp Thr Lys Asp Ser Arg Pro Lys Leu Pro Gin Thr Leu Ser 



40 



45 



AGA GGT TGG GGT GAC CAA CTC ATC TGG ACT CAG ACA TAT GAA GAA GCT 
Arg Gly Trp Gly Asp Gin Leu lie Trp Thr Gin Thr Tyr Glu Glu Ala 



50 



55 



60 



65 



70 



75 



TTG GAT GAG TGC CCA CAC AGT CAA GCT TTA AAG AAA GTG TTT GCT GAA 
Leu Asp Glu Cys Pro His Ser Gin Ala Leu Lys Lys Val Phe Ala Glu 



80 



85 



90 



AAT AAA GAA ATC CAG AAA TTG GCA GAG CAG TTT GTC CTC CTC AAT CTG 
Asn Lys Glu lie Gin Lys Leu Ala Glu Gin Phe Val Leu Leu Asn Leu 



95 



100 



105 



GTT TAT GAA ACA ACT GAC AAA CAC CTT TCT CCT GAT GGC CAG TAT GTC 
Val Tyr Glu Thr Thr Asp Lys His Leu Ser Pro Asp Gly Gin Tyr Val 



110 



115 



120 



125 



CCC AGG ATT ATG TTT GTT GAC CCA TCT CTG ACA GTT AGA GCC GAT ATC 
Pro Arg lie Met Phe Val Asp Pro Ser Leu Thr Val Arg Ala Asp lie 



130 



135 



140 



ACT GGA AGA TAT TCA AAC CGT CTC TAT GCT TAC GAA CCT GCA GAT ACA 
Thr Gly Arg Tyr Ser Asn Arg Leu Tyr Ala Tyr Glu Pro Ala Asp Thr 



159 



207 



255 



CTA TAT AAA TCC AAG ACA AGC AAC CCC TTG ATG ATT ATT CAT CAC 303 
Leu Tyr Lys Ser Lys Thr Ser Asn Lys Pro Leu Met lie lie His His 



351 



399 



447 



495 



543 
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Ala Leu Leu Leu Asp Asn Met L^ Lys Ala Leu Lys Leu Leu Lys Thr 

160 165 170 

GAA TTG TAAAGAAAAA AAATCTCCAA GCCCTTCTGT CTGTCAGGCC TTG 640 
Glu Leu 

/ 

175 ' \ 

AG AC TTG AAA CCAGAAGAAG TGTGAGAAGA CTGGCTAGTG TGGAAGCATA GTGAACAGAC 700 
TGATTAGGTT ATGGTTTAAT GTTACAACAA CTATTTTTTA AGAAAAACAA GTTTTAGAAA 760 
TTTGGTTTCA AGTGTACATG TGTGAAAACA ATATTGTATA CTACCATAGT GAGCCATGAT «ZU 
TTTCTAAAAA AAAAAATAAA TGTTTTGGGG GTGTTCTGTT TTCTCC 866 



